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EXECUTIVE  SUMMARY 


LIT  Research  Institute  and  Honeywell  SSED  have  teamed  to  uvelop  a  reliability  prediction 
model  for  VHSIC  and  VHSIC-Like  CMOS  devices  with  the  intent  that  the  model  be  suitable  for 
inclusion  into  MIL-HDBK-217.  Traditional  methods  of  reliability  prediction  modeling  have  relied 
on  the  statistical  analysis  of  empirical  field  failure  rate  data.  Since  no  field  data  was  available  for 
this  purpose,  a  reliability  physics  based  modeling  approach  was  used  in  conjunction  with  empirical 
data  from  life  tests,  environmental  tests,  and  test  structures. 

Two  separate  models  were  developed;  a  detailed  model  and  a  short  form  model.  The  detailed 
model  is  based  on  the  characteristics  of  specific  failure  modes,  manufacturer  specific  information 
such  as  defect  density  and  wearout  performance,  and  key  ^.plication  data  including  temperature 
and  operating  time.  The  short  form  model  is  a  condensed  version  of  the  detailed  model  and  does 
not  require  manufacturer  specific  information  but  rather  easily  accessible  information.  The  penalty 
in  using  the  short  model  is  its  lower  precision  and  accuracy  relative  to  tlse  detailed  model. 

In  addition  to  the  data  supplied  by  Honeywell  to  tins  effort,  a  database  was  built  containing 
life  test,  bum-in,  and  environmental  test  results  from  a  variety  of  manufacturers.  Much  of  die  data 
contained  in  this  database  was  used  in  the  quantification  of  early  life  failure  rates  for  various 
specific  failure  mechanisms,  'therefore  the  detailed  model  predicting  defect  related  early  life  failure 
rates  will  yield  an  industry  wide  representative  failure  rate.  It  was  also  determined  in  this  study 
tliat  it  is  tiie.se  defect  related  mechanisms  that  drive  the  failure  rate  in  the  part’s  useful  life.  Intrinsic 
wearout  mechanisms  have  also  been  modeled  which  will  provide  an  approximate  end  of  life  time  as 
a  function  of  tire  parts  design  rules  and  its  particular'  application. 

There  was  also  difficulty  in  predicting  failure  rates  due  to  event  driven,  design  related  failure 
mechanisms  such  as  electrosfatic  discharge  (ESD)  and  CMOS  latch-up.  Event  statistics  are 
generally  not  available,  and  robustness  varies  greatly  among  products  and  manufacturers.  This  can 
lead  to  very  unpredictable  application  problem  often  resolved  by  reliability  engineers. 


Tit.  detailed  model  has  ten  validated  with  the  life  test  data  that  was  available  on  1 .0  and  1 .2 
micron  processes  from  three  separate  manufacturing  processes.  It  was  also  observed  that  there 
were  relatively  large  variations  in  observed  failure  rates  between  manufacturers.  The  model 
partially  accounts  for  these  variations  by  including  fabrication  specific  information  such  as  defect 
density  and  wearout  data. 


i 


Another  difficulty  was  the  large  number  of  failures  reported  for  which  the  failure  mechanism 
was  unknown,  implying  some  level  of  uncertainty  regarding  the  completions  of  the  present 
models. 

It  is  quite  apparent  from  the  results  of  this  study  that  a  total  quality  management  approach 
presents  demands  which  go  beyond  traditional  statistical  process  control,  which  seeks  stabilization 
of  process  outcomes,  monitored  in  terms  of  measured  quantities  such  as  film  thicknesses,  lateral 
dimensions,  and  electrical  parameters.  Clearly,  an  additional  important  requirement  exists  for 
monitoring  and  reducing  various  process  defect  densities.  This  will  help  to  reduce  the  number  of 
products  which  contain  the  types  of  defects  known  to  cause  early  life  failures.  However, 
achieving  tire  highest  attainable  quality  in  an  environment  where  defects  are  infrequent  and 
randomly  placed  implies  an  obvious  need  for  100%  product  level  electrical  testing  for  time  zero 
failures  caused  by  defects,  as  well  as  100%  product  level  reliability  screening  for  early  life  failure 
mechanisms  influenced  by  defects. 

This  effort  also  attempted  to  utilize  data  from  ongoing  efforts  such  as  the  yield  enhancement 
and  generic  qualification  programs.  It  was  unfortunately  concluded  that  die  level  of  standardization 
(between  manufacturers)  necessary  for  input  to  a  reliabilkj  model  has  not  yet  been  achieved. 
Therefore,  data  from  these  efforts  are  used  as  an  input  to  the  model  in  a  more  qualitative  sense  in 
which  case  a  multiplication  factor  is  applied  modifying  the  failure  rate. 

In  summary,  it  is  recognized  that  there  is  no  perfect  reliability  model  covering  all  CMOS 
VLSI  devices  from  a  variety  of  manufacturers.  Given  this,  the  goal  of  the  model  is  to  strive  for 
accuracy  for  most  devices,  manufacturers,  and  applications.  It  is  the  view  of  IITRI  and  Honeywell 
that  tire  model  meets  its  goals  in  that  it  is  sensitive  to  lire  proper'  factors  effecting  reliability  in  the 
proper  proportion. 
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EVALUATION 


The  objective  of  this  effort  was  to  develop  a  reliability  prediction  model  for  fielded 
CMOS  VHSIC  and  VHSIC-Like  devices.  Since  little  or  no  field  reliability  data  was 
available,  an  approach  was  taken  that  used  methods  which  deviated  from  the  traditional 
statistical  analysis  of  field  failure  rate  data.  The  effort  was  successful  in  accomplish¬ 
ing  this  objective  with  the  development  of  two  models,  a  detailed  model  and  a  short 
form  model,  for  predicting  failure  rates  for  VHSIC  and  VHSIC-Like  CMOS  microcir¬ 
cuits. 

The  detailed  model  is  based  on  the  characteristics  of  specific  failure  modes,  manufac¬ 
turer  specific  information  such  as  defect  density  and  wearout  pei  ■  mance,  and  key 
application  data  including  temperature  and  operating  time.  The  short  form  model  is  a 
condensed  version  of  the  detailed  model  and  does  not  require  manufacturer  specific 
information,  but  rather  easily  accessible  information.  The  penalty  in  using  the  short 
model  is  its  lower  precision  and  accuracy  relative  to  the  detailed  model. 

The  models  account  for  both  time  dependent  and  defect-related  failure  mechanisms.  A 
data  base  was  built  containing  the  life  test,  burn-in  and  environmental  test  results  from 
a  variety  of  manufacturers.  Much  of  the  data  contained  in  this  data  base  was  used  in 
the  quantification  of  early  life  failure  rates  for  various  specific  failure  mechanisms. 
Therefore  the  detailed  model,  in  predicting  defect-related  early  life  failure  rates,  will 
yield  an  industry  wide  representative  failure  rate.  The  use  of  actual  defect  densities,  if 
properly  measured,  will  result  in  predicted  reliability  values  which  are  more  precise  and 
accurate  than  conventional  regression  type  prediction  models. 

It  was  also  determined  in  this  study  that  it  is  these  defect-related  mechanisms  that 
drive  failure  rate  in  the  parts  useful  life.  Wearout  mechanisms  have  also  been  model  id 
which  will  provide  an  approximate  end  of  lifetime  as  a  function  of  the  parts  design  rul<,v 
and  its  particular  application. 

The  model  addresses  three  time-dependent  mechanisms}  electromigration,  time-depen¬ 
dent  dielectric  breakdown,  and  hot  "tarrier  effects.  The  model  has  factors  for  chip  area, 
defect  density  and/or  minimum  feature  size  so  that  changes  in  technology  can  readily 
be  factored  in.  It  has  a  correction  factor  to  modify  the  model  as  VHSiC  field 
experience  becomes  available  and  to  modify  the  m^dei  for  a  particular  fabrication 
process  based  on  the  availability  of  high  quality  life  tests.  The  model  can  also  utilize 
test  pattern  data  from  manufacturers  in  conjunction  with  the  Yield  Enhancement  and 
Generic  Qualification  programs.  There  is  a  package  factor  which  considers  the  number 
of  package  pins  and  includes  the  following  package  types:  PIN  Grid  Arrays,  Chip 
Carriers,  and  Dual-in-Line  Packages.  It  also  has  factors  for  EOS/ESD  and  whether  or 
not  the  device  is  on  the  QPL/QML. 

The  detailed  model  has  been  validated  with  the  life  test  data  that  was  available  on  1.0 
and  1.25  micron  feature  size  devices  from  three  separate  manufacturing  processes.  The 
models  will  be  proposed  for  Inclusion  In  M1L-HDBK-217  "Reliability  Prediction  of 
Electronic  Equipment." 

PETER  l\  MANNO 
Reliability  Assurance  Branch 
Microelectronics  Reliability  Division 
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1.0  INTRODUCTION 


The  intent  of  this  effort  was  to  develop  a  reliability  prediction  model  for  fielded  CMOS  VHSIC 
and  VHSIC-Like  integrated  circuits.  Since  no  field  reliability  data  on  these  circuit  types  was 
anticipated  for  development  of  these  models,  methods  were  used  that  deviated  from  the  traditional 
statistical  analysis  of  field  failure  rate  performance  of  the  device  types  being  modeled. 

Since  integrated  circuit  technology  is  advancing  at  a  pace  more  rapid  than  our  ability  to  collect 
field  data  and  build  models  accordingly,  tills  program  also  represented  an  effort  to  develop 
reliability  prediction  model  development  methodologies  which  alleviate  the  need  for  extensive 
quantities  of  field  experience. 

It  was  also  an  intent  of  this  study  is  to  make  maximum  use  of  the  voluminous  amounts  of  work 
done  and  being  done  in  the  area  of  high  density  CMOS  reliability  assurance.  Data  was  collected 
during  this  program  from  Honeywell  and  other  semiconductor  manufacturers  willing  to  participate. 
This  approach  assured  that  the  prediction  models  developed  are  not  valid  only  for  Honeywell 
devices,  but  rather  models  that  are  reflective  of  the  entire  VHSIC  and  VHSIC-Like  CMOS 
industry. 

The  general  failure  rate  model  was  developed  using  data  of  various  manufacturers  ami  therefore 
calculates  an  average  expected  failure  rate.  In  reality,  large  variations  in  failure  rates  are  observed 
between  manufacturers.  In  most  cases  the  root  cause  for  this  difference  cannot  be  modeled  in  a 
ger^ral  reliability  prediction  model.  Attempts  were  made  to  address  this  difference  by  including 
factors  such  as  defect  density,  however,  there  are  still  many  very  subtle  factors  which  heavily 
influence  reliability,  many  of  which  are  not  fully  understood,  much  less  quantifiable.  It  should  be 
noted,  however,  that  the  model  is  applicable  only  for  a  relatively  mature  technology  that 
consistently  yields  acceptable  die  from  every  completed  wafer. 

It  is  also  noted  that  other  factors  may  strongly  influence  reliability  more  broadly  defined  as 
trouble  free  application  of  products  in  real  systems.  Design  related  overstress,  ESD  and  latch-up 
robusutess  fall  into  this  category,  and  these  may  vary  greatly  among  products  and  manufacturers. 
These  are  also  event  driven  failure  mechanisms,  and  failure  rate  prediction  would  require 
knowledge  of  event  statistics  as  well  as  device  robustness. 
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Additionally,  although  this  is  primarily  a  theoretical,  bottom  up  approach  to  reliability  model 
building,  this  study  attempted  to  maximize  the  use  of  empirical  data,  on  both  specific  failure 
mechanisms  and  the  entire  device.  This  represents  a  radical  departure  from  traditional  failure  rate 
modeling  methods  since  it  offers  the  potential  of  using  data  on  specific  failure  mechanisms  (from 
test  structures,  life  tests,  etc.),  whereas  historical  methods  have  required  empirical  data  on  the 
entire  packaged  part 

Also  recognized  in  this  effort  is  the  need  for  various  types  of  models  filling  the  needs  of 
specific  users.  For  example,  to  fully  utilize  the  knowledge  base  of  large  scale  CMOS  reliability,  a 
model  which  uses  detailed  physical  parameters  of  the  circuits  being  fabricated  is  required. 
Additionally,  the  need  is  also  recognized  for  a  short,  easy  to  use  model,  capable  of  rapidly 
providing  reliability  estimates.  This  effort  addressed  the  needs  of  these  two  user  types  by 
providing  both  a  detailed  model  and  a  short  form  model. 
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2.C  REPORT  ORGANIZATION  AND  NOTATION 


The  remainder  of  this  report  is  organized  as  follows: 

«  Section  3.0  presents  the  approach  taken  in  this  study  to  meet  the  stated  objectives.'  It 
summarizes  the  methodology  used  to  separate  failure  rate  contributions  from  individual 
failure  mechanisms  and  also  how  the  early  life  and  wearout  failure  rates  were  quantified. 
It  also  discusses  the  relationship  of  this  effort  to  related  efforts  such  as  the  yield 
enhancement  and  generic  qualification  programs. 

•  Section  4.0  discusses  the  database  that  was  built  for  this  effort,  the  manner  in  which  data 
was  collected,  and  profiles  of  the  database  regarding  the  type  and  complexities  of  devices 
the  data  was  collected  from. 

•  Section  5.0  is  the  main  section  of  this  report  which  presents  the  derivation  of  failure  rate 
models  for  each  failure  mechanism  that  was  modeled.  Failure  rate  contributions  were 
derived  and  summarized  in  this  section  for  oxide,  metal,  hot  carriers,  contamination, 
package  related  failures,  electrical  overstress,  and  a  miscellaneous  category.  Section  5.0 
also  presents  a  detailed  discussion  of  the  relationships  between  yield,  defect  density,  die 
area,  and  device  type,  and  presents  the  rationale  used  in  this  effort  for  making  die  model  a 
function  of  tire  defect  density/area  product. 

•  Section  6.0  presents  the  complete  detailed  version  of  the  model,  inclusive  of  all  failure 
rates  modeled  Due  to  die  relative  complexity  of  die  model,  die  calculations  from  it  are 
tedious.  Therefore,  a  computer  program  was  written  to  facilitate  calculations.  In  this 
computer  program,  the  user  simply  enters  the  desired  input  variables  and  can  then 
perform  a  prediction  at  a  single  instant  in  dme  or  can  choose  the  charting  option  diat  will 
calculate  the  failure  rate  as  a  function  of  time  and  plot  die  resulting  values. 

•  Section  7.0  presents  the  derivation  of  each  factor  in  die  short  form  model  along  with  a 
summarization  of  the  complete  sliort  model. 
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Section  8.0  discusses  modeling  considerations  given  to  fault  tolerant  designs.  Since 
there  are  too  many  design  possibilities  to  adequately  model  fault  tolerance  with  a  single 
factor,  there  is  no  factor  in  the  model,  but  rather  guidelines  are  presented  to  model  its 
effect  based  on  a  detailed  knowledge  of  the  device  architecture  and  d  jsign.  Examples  of 
fault  tolerant  techniques  and  guidelines  to  model  them  are  presented  in  this  section. 

Section  9.0  presents  a  summary  of  the  1.0  and  1.2  micron  data  that  was  used  in  the 
model  validation  phase  of  this  effort,  and  a  summary  of  how  the  predicted  values 
compared  to  the  observed.  It  also  discusses  the  variation  in  failure  rates  observed  as  a 
function  of  the  manufacturing  process  and  presents  "  ome  data  on  the  accuracy  that  can  be 
expected  from  the  prediction. 

Section  10.0  presents  sample  calculations  for  18  separate  combinations  of  input 
variables,  and  gives  plots  of  predicted  failure  rate  as  a  function  of  time. 

Section  1 1.0  discusses  briefly  how  field  data,  when  available,  can  be  used  to  modify  the 
model 


Section  12.0  discusses  conclusions  and  recommendations  regarding  this  modeling  effort. 


NOTATION  AND  DEFINITIONS 


A 

Ag 

AE^vx 

Aw 


=  Gate  Oxide  Area  as  Used  in  Section  5.1.2.3  and  Total  Die  Area  as  Used 
Elsewhere 

=  Constant  in  Black's  Equation 
=  Acceleration  Due  to  the  Electric  Field 
=  Acceleration  Factor  Due  to  the  Oxide  Electric  Field 


aKC 


AHE 


Art 

Ajv 

Ar 

% 


athc 


-  Magnitude  of  Hot  Carrier  Degradation 

~  Avalanche  Hot  Electn  n 
=  Oxide  Area  of  Chip  N 

a  Oxide  Area  of  Chip  0  for  which  Reliability  is  to  be  Extracted  from  Data  on 
Test  Structure  Oxide  with  on  a  Ag 

a  Reference  Die  Area 
«  Acceleration  Due  to  Temperature 

«  Temperature  Acceleration  Factor  for  Contarainatioti  Related  Failures 
»  Temperature  Acceleration  Factor  for  Hot  Carrier  Degradation 
=  Temperature  Acceleration  for  the  Metal  Failure  Rare 
«  Tanperature  Acceleration  Factor  for  Miscellaneous  Failure 


ATgjj  »  Temperature  Acceleration  Factor  for  lire  Oxide  Failure  Rate 

ATYFEmET  ”  A  constant  whicli  accounts  for  tire  relative  differences  in  metal  lengtlis 
between  various  device  types 


AtYPEox  =  A  constant  which  accounts  for  the  relative  differences  in  oxide  area  densities 
between  various  device  types 

AYqx  =  Oxide  Electric  field  acceleration  factor 

C  =  Constant 

CDIP  -  Ceramic  DIP 

CLCC  as  Ceramic  Leadless  Chip  Carrier 

CPGA  -  Ceramic  Pin  Grid  Array 

1X3  =  Duty  Cycle,  %  Operating  Time 

Djvpp  =  Effective  Defect  Density 


DIP 

Dn 

*>0 

% 

Ds 


a  Dual  In  Line  Package 

«  Oxide  Defect  Density  of  Citip  0  (Used  only  in  Section  5.1 .2.3) 

«  Oxide  Defect  Density  as  used  in  Section  5. 1 .2.3,  and  Critical  Defect  Density  for 
Feature  Size  Xq  Elsewhere 

»  Oxide  Defect  Density 

*  Metal  Defect  Density 

a  Reference  Defect  Density 

«  Defect  Density  of  a  Test  Structure 


D(w)  Area  Density  with  Weakness  Factor  Larger  than  w 

E  «  Gate  Oxide  Breakdown  Strength 
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Ea  =  Activation  Energy,  Per  the  Arrhenius  Relationship  (in  Electron  Volts,  eV) 

Ec  =  Normalizing  Electric  Field 

EOS  =  Electrostatic  Discharge 

EqX  =  Electric  Field  Strength  in  the  Oxide  (MV/cm) 

Eref  =  Reference  Electric  Field 

ESD  =  Electrostatic  Discharge 

Es  =  Actual  Electric  Field 

FIT  »  Failure  Unit  (Failures/lO^  hours) 

FO  as  Fallout  Rate  (Percentage) 

FOr  “  Reference  Fallout  Rate 

f(t)  a  Time  to  Failure  Probability  Density  Function 

F(x)  a  Cumulative  Probability  Distribution  as  a  Function  of  x 

G  *s  Process  Specific  Constant 

h(t)  =*  Hazard  Rate 

%  «  Drain  Current 

%UB  e  Substrate  Current 


Current  Density 


K 


=  Boltzmaa's  Constant » 8.65  x  10 


5g) 
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ldd 

m 

n 

P 

PDIP 

P(0 

i\m 

PGA 

PLCC 

PPGA 

P(w) 

QML 

Qn(0 

Qq(0 

0.(0 

0(0 

R 

r 


=  Low  Dose  Drain 

=  Exponent  for  Isu^  Term  in  Hot  Carrier  Model 

=  Exponent  of  Current  Density  for  Black's  Equation  in  Section  5.2  and 
Exponent  for  Hot  Carried  Degradation  in  Section  5.3 

=  Power 

=  Plastic  DIP 

=  Probability  of  Failure  due  to  EOS  or  ESD 
a  Probability  of  Failure  given  Contact  from  an  EOS/ESD  Source 
=  Pin  Grid  Array 
»  Plastic  Lcadlcss  Chip  Carrier 
=  Plastic  Pin  Grid  Array 

»  Probability  of  containing  at  least  one  defect  with  weakness  factor  w 

*a  Qualified  Manufacturers  list 
«  Cumulative  Failure  Density  ot  Grip  N 

»  Cumulative  Failure  Distribution  for  an  Oxide  with  Defect  Density  Dn 
ca  Cumulative  Failure  Density  for  a  Test  Structure 

»  Cumulative  Failure  Density  Function 
®  Duty  Ratio  for  Substrate  Current 
»  Duty  Cycle 
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RH 

=  Relative  Humidity 

RHeff 

=  Effective  Relative  Humidity 

R(t) 

=  Reliability  Function 

s 

=  Shape  Parameter  Used  in  tlie  Stapper  Yield  Model 

SD 

=  Single  Drain 

SvT 

=  Constant 

SV 

=  Standard  Derivation 

T 

»  Temperatme 

t 

“  Time 

=  Time  to  Breakdown 

T 

*C 

a  Case  Temperature 

1DDB 

a  Time  Dependent  Dielectric  Breakdown 

Tj 

-  Junction  Temperature 

To 

«  Reference  Tcmparatme 

«o 

»  Effective  Screening  Time 

*50 

a=  Time  at  which  50%  of  the  Population  Fails 

*SGr ef 

-  Reference  150  Ym» 

v<i 

=>  Drain  Voltage 

9 


VGS 

VSD 


VTH 

*0 

*s 

yn 


AH 

%A 


X 

h 


XBP 

^CON 

*EOS 

*esd 

bm 


-  Gate  Source  Voltage 
=  Source  Drain  Voltage 
=  ESD  Failure  Threshold  Voltage 
=  Reference  Feature  Size 
=3  Actual  Feature  Size 
=  Constant 

»  Constant 
»  Activation  Energy 
«  Junction-Ambient  Thermal  Resistance 
w  Junction-Case  llternwl  Resistance 

ss  Failure  Rata  (in  Failures  per  Million  hours) 

«*  Base  Failure  fate 

*»  Base  lockage  Failure  Rate 

»  EOS/ESD  Contact  Rate 

*>  Coniaxainatioa  Failure  Rata 

&  Electrical  Ovetsiress  Failure  Rate 

•»  E*xumtaiicD^^ 

»  Metal  Failure  Rale 
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*MIS 

s 

Miscellaneous  Failure  Rate 

^ox 

= 

Oxide  Failure  Rate 

^OxW 

Time  Dependent  Oxide  Failure  Rate 

^PAC 

Package  Failure  Rate 

*P 

= 

Predicted  Failure  Rate 

XpH 

» 

Package  Henneticity  Failure  Rate 

m 

C2 

Failure  Rate  as  a  Function  of  Time 

He 

33 

Field  Data  Gorrecdon  Factor 

Hcd 

*3 

F  e  Complexity  Factor 

% 

S3 

Environment  Factor 

%FG 

W 

Manufacturing  Process  Correction  Factor 

%T 

S3 

Package  Type  Factor 

Hq 

S3 

Quality  Level  Factor 

n$P 

S3 

Package  Screening  Factor 

■Jss 

Standard  Deviation  forKcrmctidty  Related  Failure  Mechanisms 

^HC 

as 

Standard  Deviation  for  die  distribution  of  Hot  Carrier  Failures 

sr 

Standard  Deviation  for  the  metallization  electremigmtion  failure  distribution 

n 


Cqj,  =  Standard  Deviation  for  the  TDDB  Failure  Distribution 

x  =  Exponential  Time  Constant 

Xjj£>  =  Hot  Carrier  Lifetime 


3.0  APPROACH 


The  approach  taken  in  this  effort  was  to  analyze  each  failure  mechanism  independently  and 
develop  a  separate  failure  rate  ex^ession  for  each.  This  approach  assumes  that  each  failure 
mechanism  is  independent  of  each  other  and  one  is  not  the  result  of  the  other.  While  in  all  cases 
this  is  not  entirely  true,  it  represents  a  justifiable  approximation  when  considering  the  limited 
prediction  accuracy  expected  from  such  a  model.  Figure  3-1  summarizes  the  methodology  used  in 
this  study. 


FIGURE  3-1: 

MODEL  DEVELOPMENT  METHODOLOGY 
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The  first  step  in  this  process  was  to  identify  as  many  potential  failure  mechanisms/modes  as 
possible.  These  failure  mechanisms  are  listed  as  follows: 

Oxide  Failures 

Time  Dependent  Dielectric  Breakdown 

Metallization  Failures 

Electromigration 

Hot  Carriers 

Corrosion 

Ionic  Contamination 
Wire  Bond  Failures 
Package  Hermeucity  Failures 
Electrical  Overstress  (including  ESD) 

Die  Bond  Failures 
Soft  Errors 
Latch  Up 

Next,  each  of  these  were  studied  to  determine: 

(1 )  Prevalence  in  causing  failure  of  small  feature  size  CMOS 

(2)  Time  to  failure  characteristics 

(3)  How  well  they  can  be  screened 

(4)  The  factors  and  stresses  which  influence  their  failure  duuacteristics 


Since  there  are  many  more  failure  mechanisms  possible  titan  can  effectively  be  quantified  in  a 
single  model,  it  was  tltc  objective  of  this  study  to  choose  only  those  failure  mechanisms  that  drive 
of  dominate  the  failure  rate  arid  model  them  accordingly. 

To  accomplish  tins  and  to  identify  time  to  failure  characteristics,  a  database  was  developed 
from  screening  and  life  test  data  of  VHSXC-Uke  devices.  The  physics  of  failure  characteristics  of 
each  mechanism  were  studied  and  both  the  fabrication  variables  and  stresses  that  affect  each 
mechanism  were  cbtettmned. 


Next,  a  failure  rate  as  a  function  of  time  was  derived  for  each  mechanism  based  ei  ther  on 
empirical  data  from  the  database  or  based  on  theoretical  considerations.  These  failure  rate'  were 
then  comlaaed  into  a  cohesive  model 
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The  basic  premise  of  the  approach  used  in  this  effort  was  that  die  related  failure  mechanisms 
are  predominantly  accelerated  by  temperature,  voltage  and  current  and  package  related  mechanisms 
are  predominantly  accelerated  by  environmental  stresses  (primarily  temperature  cycling). 

Table  3-1  summarizes  the  number  of  observed  failures  contained  in  the  database  as  a  function 
of  test  type  and  failure  class.  This  data  supports  the  approach  being  taken  of  accelerating  die 
related  failure  mechanisms  with  temperature  and  package  related  mechanisms  with  temperature 
cycling. 


TABLE  3-1: 

DATABASE  PROFILE:  NUMBER  OF  OBSERVED  FAILURES 


Test  Type 

Failure  Gass 

Life  Test 

■jgsgi 

Misc. 

Environmental 

Oxide 

274 

7 

0 

Metal 

62 

3 

6 

Package 

24 

214 

0 

Assembly 

21 

0 

0 

Contamination 

603 

5 

0 

Unknown 

789 

54 

4 

Each  known  failure  mechanism  was  analyzed,  determining  characteristics  of  infant  mortality, 
weaiout,  random  failures,  or  some  combination  thereof. 

Each  failure  mechanism  will  tlierefore  have  either  an  early  life  (decreasing  failure  rate)  term,  a 
wcarout  failure  rate  term  (based  on  the  lognormal  time  to  failure  distribution),  a  time  independent 
(event  related)  failure  rate  term,  or  a  combination  of  tire  three.  Table  3-2  summarizes  each  failure 
mechanism  addressed  in  the  detailed  model  and  their  associated  failure  rate  tenn(s). 
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TABLE  3-2: 

FAILURE  MECHANISM  CONTRIBUTION  TERMS 


mn 

WBSSBM 

Long  Term 

Wearout  X 

Time  Independent 

X 

Oxide 

X 

X 

Note  1 

Metal 

X 

X 

Hot  Carriers 

X 

Note  1 

Contamination 

X 

Package 

X  (for  non- 

X 

hermetic  only) 

Electrical 

Overstress 

X 

Miscellaneous 

X 

X 

Note  1:  Very  recent  information  suggests  that  ESD  overstress  can  adversely  affect  both 
oxide  wearout  and  hot  carrier  degradation  rates.  A  time  independent  factor  might 
be  appropriate,  depending  on  whether  further  work  relates  this  effect  to  early  life 
failures. 

Each  entry  in  Unis  table  represents  an  additive  failure  rate  in  the  model  and  each  of  these 
failure  rates  are  described  separately  in  subsequent  sections  of  this  report. 

The  short  term  decreasing  failure  rates  for  oxide,  metal,  contamination,  and  miscellaneous 
Mure  mechanisms  are  essentially  defect  related  mechanisms  that  have  tiro  potential  of  being 
screened  out  For  these,  the  time  to  failure  data  in  tire  database  developed  for  this  study  was 
utilized  to  develop  empirical  failure  rate  relationships  as  a  function  of  time. 

The  long  term  wearout  failure  rates  are  theoretically  derived  based  on  various  sources 
including  data  reported  in  the  literature,  specialised  test  results,  and  asst  structure  results.  These  are 
all  based  on  the  lognormal  time  to  failure  distribution  and  will  predict  an  approximate  time  at  which 
the  ltd  of  life  of  that  part  is  approaching.  For  a  well  designed  part,  the  failure  rare  contribution 
from  these  mechanisms  should  be  essentially  zero  in  the  useful  life  of  the  part,  indicating  that  the 
Mure  rate  is  driven  by  tire  defect  related  mechanisms  as  well  as  the  time  independent,  event  related 


The  time  independent  mechanisms  are  those  that  are  primarily  event  related,  such  as 
electrostatic  discharge.  The  package  failure  rate  is  treated  in  our  model  as  a  time  independent 
mechanism,  however,  there  is  a  time  dependent  package  related  contribution  in  the  miscellaneous 
term.  The  rationale  for  this  is  further  discussed  in  subsequent  sections  of  tliis  report. 

In  failure  mechanisms  that  exhibit  both  a  short  term  failure  rate  and  a  wearout  failure  rate,  the 
defect  severity  distribution  plays  an  important  role  in  the  short  and  long  term  failure  characteristics. 
This  defect  severity  distribution  relates  the  severity  of  defects  to  its  prevalence  in  occurring. 

Figure  3-2  represents  a  hypothetical  case  illustrating  an  oxide  with  various  defect  severities 
(from  Reference  8).  This  example  relates  these  severities  to  the  expected  time  of  failure. 


Oxide 


Defect  Severity 


2  3  4  5 


1 .  -  Yield  loss  at  wafer  probe 

2.  -  Bum-In  Failure 

3 .  -  Failure  during  equipment  checkout 

4.  -  Failure  during  first  year  of  operation 

5 .  -  Failure  after  4  years  of  operation 


FIGURE  3-2: 

DEFECT  SEVERITY  EXAMPLE 


In  the  contained  model  herein,  2,  3  and  4  are  modeled  with  the  short  term  failure  rate 
contribution  and  5  is  modeled  with  the  terns  for  long  term  wearout.  1  is  yield  loss  and  is  not 
explicitly  addressed  in  die  modal. 


This  modeling  approach  has  several  advantages: 


(1)  By  separating  the  failure  rates  due  to  each  failure  mechanism,  the  model  can  be  more 
sensitive  to  stresses  which  affect  each  mechanism  differently.  For  example,  each 
mechanism  typically  will  have  a  unique  temperature  activation  energy,  which  can  be 
modeled  separately,  instead  of  choosing  an  average  activation  energy  which  would  result 
in  an  oversimplified  approximation  of  the  effect  of  temperature. 

(2)  It  allows  the  user  to  quantitatively  ascertain  the  benefit  of  screening  temperature  and 
duration.  Since  the  predicted  failure  rate  is  a  function  of  time,  the  user  can  model  a 
typical  failure  rate  improvement  by  trying  various  temperatures  and  durations. 

(3)  It  gives  a  better  representation  of  the  behavior  of  the  failure  rate  as  a  function  of  time  and 
also  provides  a  realistic  estimate  of  how  long  the  devices  can  be  expected  to  last  in  fielded 
systems. 

It  would  have  been  very  advantageous  to  this  effort  if  it  were  possible  to  identify  and  rank  the 
significant  failure  modes  and  mechanisms.  Unfortunately,  the  number  of  variables  effecting  the 
prevalence  of  each  failure  mode  and  mechanism  is  very  high  and  therefore  this  ranking  would  only 
be  valid  for  the  device  whose  data  was  used  in  deriving  the  ranking.  In  other  words,  one  CMOS 
process  may  have  one  particular  failure  mechanism  prevalent,  whereas  another  process  may  have 
different  mechanisms  prevalent.  This  is  due  to  the  many  variables  controlling  the  failure  mode 
distribution.  To  attempt  to  quantify  this  variability*  a  survey  was  issued  to  various  CMOS  VLSI 
manufacturers  in  which  they  were  asked  to  give  percentages  of  failures  they  observe  for  each 
failure  mechanism  listed.  This  survey  questionnaire  is  given  in  Appendix  A.  Table  3-3 
summarizes  the  seven  responses  of  this  survey  regarding  the  failure  mode  distributions,  and 
indicates  a  wide  variation  in  the  prevalence  of  the  observed  failure  mechanisms.  These  results 
illustrate  the  difficulty  in  deriving  a  model  based  on  expert  opinion. 
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TABLE  3-3: 

FAILURE  MECHANISMS  DISTRIBUTIONS 


Failure  Mode/Mechanism 

1 

2 

Survey  Responses 

3  4  5 

6 

7 

Electromigration 

13% 

X 

Dielectric  Breakdown 

X 

50% 

<.1% 

98% 

- 

2% 

X 

Soft  Errors 

- 

- 

- 

- 

- 

- 

Parametric  Drift 

X 

- 

.1% 

- 

- 

38% 

X 

Hot  Elections 

- 

- 

- 

- 

- 

»  * 

Latch  Up 

X 

10% 

.1% 

- 

X 

- 

X 

Electrical  Overstress 

X 

20% 

2% 

X 

- 

Package  Related 

- 

20% 

<.1% 

- 

1 

28% 

SB 

Other 

" 

• 

* 

B 

19% 

■ 

|  x  »  Failure  Mode  occurs  but  no  percentage  given  in  survey  response. 


3.1  MODELING  TIME  TO  FAILURE  CHARACI’ERTSTTCS 

It  was  originally  intended  to  quantify  a  bimodal  distribution  for  cadi  failure  mechanism  For 
example,  Figure  3-3  illustrates  the  bimodal  distribution. 
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FIGURE  3-3: 

BIMODAL  DISTRIBUTION 

Where  the  ratio  of  the  area  under  each  curve  are  derived  from  the  data  contained  in  the 
database  developed  for  this  effort.  For  example,  if  the  database  indicates  that  1%  of  the  population 
of  devices  fail  in  screening  tests  for  a  specific  failure  mechanism,  then  the  area  under  the  first  curve 
will  be  1%  of  the  total. 

In  the  case  of  ionic  contamination,  screening  (or  test)  effectiveness  is  very  high,  and  there  are 
no  wearout  failures  expected  (i.e.,  all  infant  mortality  failures),  the  time  to  failure  distribution  in 
Figure  3-4  will  result  in  the  case  where  1%  of  the  population  fails  in  screening  from  contamination. 
Therefore,  this  is  not  the  probability  density  function  for  the  entire  part  but  only  for  contamination. 
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t  0 


1 1 


FIGURE  3-4: 

FIRST  MODE  OF  THE  BIMODAL  DISTRIBUTION 

In  Figure  3-4  the  short  term  screening  failures  are  expected  but  no  wearout  failures  are 

expected  due  to  the  characteristics  of  contamination.  If  a  device  has  been  exposed  to  screens  for  an 
equivalent  time  of  tj,  then  the  initial  time  of  field  operation  would  be  tl.  If  it  has  not  been 

screened,  then  the  initial  time  of  field  operation  would  begin  at  to  and  1%  of  the  population  (i.e., 
all  contamination  failures)  would  be  expected  to  fail  between  to  and  ti  from  contamination. 

It  became  apparent  that,  while  the  wearout  portion  of  the  distribution  could  be  fairly  well 
defined,  the  early  life  defect  related  portion  could  not  be  modeled  as  an  increasing  and  then 
decreasing  distribution. 

Also,  although  the  time  to  failure  characteristics  were  analyzed,  all  models  were  developed  in 
a  hazard  rate  format,  instead  of  time  to  failure,  to  be  more  consistent  with  conventional  reliability 
prediction  methodologies,  and  to  facilitate  failure  rate  predictions. 
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3.1.1  Development  Methodology 


From  the  failure  mechanisms  observed  in  the  database,  there  are  very  few  observances  of  die 
failures  occurring  from  the  wearout  failure  mechanisms  (electromigration,  TDDB  (time  dependent 
dielectric  breakdown),  Hot  Carriers,  and  moisture  related  failures  in  nonhermetic  packages).  For 
example,  the  vast  majority  of  failures  in  an  oxide  are  not  time  dependent  dielectric  breakdown,  but 
rather  other  early  life  oxide  failure  mechanisms,  possibly  due  to  defects  more  severe  than  those  that 
are  manifested  as  TDDB  failures. 

To  account  for  these  failures,  an  exponential  hazard  rate  model  was  chosen  and  fit  to  the 
observed  failure  rates  (as  a  function  of  time)  for  each  applicable  failure  mechanism.  For  example, 
the  following  basic  model  was  used: 


ko=v“ 


where: 


is  the  base  failure  rate  (constant) 
x  is  the  time  constant  of  the  exponential 
t  is  time 

The  rime  to  failure  data  contained  in  the  database  was  used  to  derive  the  observed  failure 
rates,  as  a  function  of  time  and  all  were  normalized  to  a  25°C  temperature  by  multiplying  the  actual 
time  by  the  acceleration  due  to  temperature  (Ay)  (see  Section  5. 1.2-1).  This  acceleration  was 

between  the  actual  temperature  and  25°C.  The  failure  mechanisms  for  which  early  life  failure  rates 
woo  derived  are;  Metal,  Oxide,  Contamination,  and  a  Miscellaneous  category  (containing  various 
time  dependent  assembly  and  package  related  mechanisms).  Since  package  related  failure 
mechanisms  accelerated  by  temperature  cycling  are  modeled  separately  in  tire  package  factor,  these 
factors  are  derived  only  from  high  temperature  accelerated  life  test  data. 

Each  observed  failure  was  categorized  into  ate  of  the  following  failure  classes;  oxide,  metal, 
contamination  and  miscellaneous. 


The  following  failure  mechanisms/modes  listed  under  each  class  summarize  the  observed 
failure  cause  (from  life  tests)  contained  in  the  database  for  each  of  these  classes. 

Qx 

Functional  failure,  threshold  voltage  shift 
Input  leakage  failure,  threshold  voltage  shift 
Functional  failure,  gate  oxide  step  defect 
Access  time  out  of  spec,  charge  loss,  oxide  defect 
Access  time  out  of  spec,  charge  loss,  wearout 
Nominal  march,  oxide  damage 
Nominal  march,  oxide  damage 
Page  mode  failure,  oxide  damage 

Metal 


Metal  masking  defect 

Functional  failure  from  aluminum  corrosion 
Functional  Mure  from  metal  contact  defect 
Failure  parameter  from  aluminum  corrosion 
Op<m  metal  trace 

Open  metal  trace  from  aluminum  corrosion 
Functional  metal  failure  from  pattern  shifting 

Comunafom 

Parameter  Mure  from  contamination 
Functional  failure  from  contamination 
Ionic  leakage,  bake  recoverable 
Ionic  contamination 

Bits  Mure,  ionic  contamination  from  assembly  debris 


Functional  failures  (unknown  mechanism) 

Pattern  shifting 

AC/functional 

Masking  defect 

Column  short  to  VSS 

Parametric  failure 

Diffusion  mask  defect 

Output  leakage 

Input  leakage 

Functional,  trace,  degradation 
Input  transistor  short 
Junction  short 

Pin  leakage,  bake  recoverable 
Open 

Retention  failure 

Nominal  march 

Nominal  march,  poly  defect 

Nominal  march,  marginal  room  temp.  AC 

ICC  stand  by,  out  of  spec,  low  resistor  value 

Wire  bond  Mure 


The  methodology  used  to  develop  early  life  failure  Kites  are  as  follows: 

(1)  'The  tunes  of  failure,  for  each  Mure  mechanism  werecxtrectc^ 

(2)  Tft&is  times  were  converted  to  an  equivalent  25°C  time  based  m  the  temperature 
acceleration  factor  for  each  particular  Mute  observance. 

(3)  An  equivalent  total  number  of  part  hours  (at  25°C)  for  the  entire  database  was  extracted 


(4)  A  regression  analysis  was  then  performed  on  the  failure  rates  calculated  as  a  function  of 
time,  and  fit  to  the  following  model: 


X(t)  =  X{j  e“tx 
or  In  X(t)  =  In  -  tt 

(5)  Values  of  X^  and  x  were  then  determined. 

Figure  3-5  illustrates  a  hypothetical  application  of  this  methodology.  For  this  example,  die 
following  failure  rates  and  time  intervals  were  observed. 


Time  (10°  his.) 

X 

0-  .04 

.04  -  1.67 

,015 

.0028 
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A  simple  regression  solution  was  obtained  using  the  mid  point  values  of  time  in  each  interval, 
and  by  taking  the  natural  logarithm  transformation  of  the  failure  rate  value.  The  following 
relationship  was  then  obtained: 


X(t)  =  .011  e*2*2 1 


The  next  step  was  to  insure  that  the  cumulative  failure  rate  (integral)  was  the  same  for  the 
predicted  and  observed.  This  was  accomplished  by  adjusting  the  x  value  if  required  to  meet  this 
condition.  The  X^  value  was  kept  constant  u*^use  there  was  a  relatively  high  confidence  in  it 

since  there  was  typically  a  large  number  of  part  hours  and  failures  observed  in  that  interval,  thus 
yielding  an  accurate  failure  rate  in  the  early  time  interval.  Section  5,1.1  discusses  the  derivation  of 
the  actual  oxide  failure  rate. 

The  screening  test  effectiveness  has  not  explicitly  been  accounted  for  in  the  model  since 
empirical  test  results  were  used  to  develop  the  early  life  failure  rates  (which  are  related  to  test 
effectiveness)  based  on  screening  and  life  test  results.  However,  screening  effectiveness  was 
implicitly  included  in  the  model.  It  is  logical  to  assume  therefore  that  tire  test  effectiveness  for  a 
particular  test  is  approximately  constant  tltroughout  sire  industry  at  the  present  state  of  die  art  and 
inherent  in  the  model.  It  was  attempted  to  derive  test  effectivenesses  for  various  screens.  This 
was  abandoned  due  to  the  empirical  approach  taken  and  the  fact  that  accurately  deriving  test 
effectivenesses  is  extremely  difficult,  and  in  many  cases  impossible, 

3. l.i.i  IciiaigiaMgiteisiBUQfl  Fasha 

Since  the  early  life  failure  rate  is  intended  to  modal  defect  related  failure  mechanisms,  a  given 
constant  percentage  of  the  population  is  expected  to  fail  in  a  certain  time  period  under  a  certain  set 
of  circumstances,  l-or  example,  if  the  temperature  is  raised  by  a  level  consistent  with  an 
acceleration  factor  of  10,  tire  same  percentage  of  parts  should  fail  in  1/J0th  tire  time  of  tire  original 
temperature.  Thetis: 


t  at 

J  X(t)  dt  (25°Q  *  j  X(t)  dt  (at  T) 


( A-p  acceleration  due  to  tcmperamreT) 
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and 


X(t)  =  A,jjATe-tATt 

where  Xj,  =  base  failure  rate 


3.1. 1.2  Etesi?lQutX-ClS.l£ 

The  defect  related  mechanisms  being  modeled  in  the  early  life  are  those  that  occurred  during 
operating  life  tests  and  therefore  will  typically  only  be  accelerated  to  failure  when  the  device  is  in 
operation.  Since  the  models  presented  in  this  report  are  for  operating  conditions,  they  do  not 
include  duty  cycle  as  an  input  If  the  effect  of  duty  cycle  needs  to  be  accounted  for,  it  can  be 
modeled  in  the  same  manner  as  temperature  as  follows: 

X(t)  a  (DC)  Xfc  e*T  1  (a  =  Proportional  to) 

(where  (DC) = Duty  Cycle) 

(OjSDCSI) 

In  this  case  the  duty  cycle  is  defmed  as  the  percentage  of  time  tire  device  is  in  its  normal 
operating  state.  The  one  factor  that  does  require  duty  cycle  is  the  failure  rate  of  plastic  package 
types  in  which  case  an  effective  relative  humidity  has  to  be  calculated  as  a  function  of  duty  cycle. 

Therefore,  summarizing  the  effect  of  both  temperature  and  duty  cycle  yields: 

X(t)»XbAT(DC)e*i:]DCATt 
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3. 1.1.3  Effects  of  Screening  Time 


If  a  device  is  burned  in  or  subjected  to  a  high  temperature  operating  screen,  one  would  expect 
a  certain  percentage  of  defective  parts  to  fail.  In  this  model,  this  effect  is  accounted  for  simply  by 
adding  an  effective  screening  time  (to)  to  the  time  variable  (t).  An  example  showing  the  effects  of 

screening  time  is  presented  in  Figure  3-6. 


tQ  »  Effective  Screening  time 
«  (Actual  Screening  time)  A*j* 


X(t)  -  Ah  tf*  W  AT  (DC)  e<DC>  AT  ‘ 


Note  that  the  Aj  may  be  different  for  screening  and  use  environments.  It  should  also  be 

noted  here  that  only  life  test  results  were  used  for  development  of  these  failure  rates,  and  all 
environmental  test  results  were  used  only  for  the  package  factor.  There  were,  however,  a  few  time 
dependent  package  related  mechanisms  occurring  during  life  tests,  and  these  are  accounted  for  in 
the  miscellaneous  failure  rate. 

Therefore,  using  the  method  outlined  herein,  Table  3-4  summarizes  the  parameters  derived 
for  Xb,  t,  and  activation  energy  (Ea).  Ea  is  to  be  used  for  At  for  each  failure  mechanism. 


TABLE  3-4: 

EARLY  LIFE  FAILURE  RATE  PARAMETERS 


Failure  Mechanism 

Xb  (F/106  hrs.) 

EBzSi 

m 

Metal 

.00102 

1.18 

Oxide 

.0788 

7.70 

Contamination 

.000022 

.0028 

1.0 

Miscellaneous 

,010 

2.2 

.43. 

The  derivation  of  each  of  these  failure  rates  is  given  in  subsequent  sections  of  this  report 
which  discuss  each  failure  mechanism  separately. 

Since  approximately  58%  of  all  failures  were  of  unknown  failure  mechanisms  (no  failure 
analysis  or  inconclusive  failure  analysis)  the  parameters  in  Table  3-4  were  derived  by  assuming 
that  the  unknown  failure  medtanisms  had  the  stums  relative  percentages  between  mechanisms  as  the 
known  failure  mechanism  distribution.  In  this  manner  all  failures  were  accounted  for. 

3.1.2  Madding 

All  wearout  failure  mechanisms  in  this  model  have  been  modeled  with  a  lognormal  time  to 
failure  distribution.  All  wearout  failure  mechanisms  modeled  in  this  effort  have  empirically  been 
shown  by  many  researchers  to  follow  the  lognormal  distribution.  Titis  distribution  is  given  by: 
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1 


exp 


f(t)  = 


(In  t  -  In  tso)2 

- U - 


where  t  is  time,  is  the  time  at  which  50%  of  the  population  fails,  and  c  is  the  standard 
deviation. 

Since  the  prediction  model  is  in  the  form  of  a  hazard  (h(t))  or  failure  rate,  the  hazard  rate  of 
the  lognormal  must  be  used.  This  is  given  by: 


h(t)  = 


M. 

R(t) 


Since  R(t).,  the  reliability  function,  involves  an  integral  and  becomes  complex  for  the 
lognormal  distribution,  the  hazard  rate  cannot  be  obtained  in  a  closed  form  solution  over  all  times. 


Various  statistical  simulations  were  performed  to  determine  if  the  lognormal  distribution 
could  be  approximated  with  other  simpler  distributions.  A  pdf  (probability  density  function)  was 
obtained  for  the  extreme  value  of  the  lognormal  and  this  truncated  distribution  was  then  subjected 
to  the  Kolmogorov-Smimov  (or  K-S)  test  to  determine  which,  if  any  distribution  fitted  this  data 
set.  A  constant  failure  rate  (exponential  pdf),  Poisson,  or  normal  distributions  clearly  did  not  fit 
the  distribution. 

If  however,  the  reliability  is  relatively  high  (i.e.,  greater  than  .8),  the  probability  density 
function  itself  represents  a  good  approximation  to  the  hazard  rate,  so  that  if  the  reliability  is  greater 
than  .8,  there  is  no  more  than  20%  error  in  this  approximation.  The  point  at  which  the  reliability  is 
.8  for  any  given  mechanism  signals  that  the  device  population  is  reaching  end  of  life,  and  the 
hazard  rate  will  be  dramatically  increasing.  Therefore,  by  defining  the  model  to  be  valid  only  for 
those  times  where  the  reliability  is  greater  than  .8,  the  closed  form  probability  density  function  can 
be  used  to  approximate  the  hazard  rate.  Beyond  this  time,  the  model  is  not  valid. 

Since  there  are  several  variables  affecting  the  failure  rate  of  the  wearout  mechanisms,  the  end 
of  life  time  has  been  defined  to  be  that  in  which  the  time  is  equal  to  .5  t^Q  or  when  the  failure  rate 

for  a  single  mechanism  has  reached  .1  F/IO^  hrs.,  whichever  is  less.  The  failure  rate  predictions 
are  therefore  invalid  beyond  these  times. 
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The  Figures  3-7  through  3-12  illustrate  the  lognormal  distribution’s  probability  density 
function,  cumulative  distribution  function,  and  hazard  function  for  several  combinations  of  means 
(t5o)  and  standard  deviations  (a).  The  mean  and  sigma  are  given  in  the  upper  right  hand  comer  of 

each  graph. 

One  area  of  concern  in  the  use  of  the  lognormal  distribution  is  its  high  sensitivity  to  variations 
in  sigma.  In  all  distributions  defined  thus  far,  a  sigma  of  1.0  is  typical.  Although  it  is  evident  that 
this  value  is  fairly  well  accepted,  slight  deviations  from  it  can  significantly  affect  the  model.  To 
illustrate  this.  Table  3-5  shows  the  results  of  a  few  failure  rate  calculations  at  10  years.  This 
dependency  can  also  be  seen  in  Figure  3-12  which  illustrates  the  hazard  rate  at  the  extreme  value  of 
a  lognormal  distribution  with  a  mean  life  of  10^  and  a  sigma  which  varies  from  .7  to  1.3. 

TABLE  3-5: 

HAZARD  RATE  AS  A  FUNCTION  OF  MEAN  AND  SIGMA 


*50 

.5 

.9 

Sigma 

1.0 

i.i 

1.5 

106 

6.4xl0"n 

1.3xl0"7 

2.3xl07 

3.6xl0"7 

8.1xlO”7 

5xl06 

2.1xl0"10 

8.0xl0"8 

10? 

Wm^SSmm 

4,9x10"* 2 

■Haul 

3.9xlO"10 

2.1xl0“8 

From  these  numbers  it  can  be  seen  that  using  a  sigma  of  .9  or  1,1  for  a  projected  tjo  of  10” 
hours  can  mean  the  difference  from  130  to  360  RTS  (i.e„  Failures/10^  hours).  For  longer  tgQ  the 

relative  change  may  be  1  or  2  orders  of  magnitude  although  the  actual  failure  rate  values  are  much 
smaller.  Large  uncertainties  in  the  sigma  resuit  in  very  large  uncertainties  of  die  failure  rate. 

While  a  sigma  in  the  range  of  1.0  is  reasonable  and  consistent  with  theory,  the  range  of 
sigma  reported  in  published  data  vary  widely.  The  ideal  model  would  measure  how  aggressive  a 
manufacturer’s  design  rules  and  process  controls  were.  For  example,  if  every  metal  snipe  carried 
tits  maximum  current  density  and  the  process  was  marginal  or  had  wide  variations  (i.e.,  step 
coverage  varied  from  10%  to  60%  with  design  roles  specifying  30%),  then  there  would  be  a 
considerable  clcctromigration  risk.  Another  design  may  have  just  a  few  stripes  where  the  current 
density  is  maximum  and  never  experience  clcctromigration.  While  it  may  be  possible  to  develop 
design  analysis  software  tools  which  calculate  current  densities  for  every  line  and  then  sum  them  in 
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FIGURE  3-7:  PROBABILITY  DENSITY  FUNCTION  LOGNOl 
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FIGURE  3-8:  CUMULATIVE  DISTRIBUTION  FUNCTION  LOGNO! 


FIGURE  3-9:  HAZARD  FUNCTION  LOGNORMAL 


FIGURE  3-11:  CUMULATIVE  DISTRIB' 
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FIGURE  3-12:  HAZARD  FUNCTION  LOGNORMAL 


some  tnaaner,  it  is  not  likely  that  such  information  will  ever  be  released  publicly  or  to  the  DoD. 
Some  manufacturers  may  also  know  that  their  step  coverage  and  line  width  vary  significantly,  but 
such  information  is  not  likely  to  be  released.  Both  of  these  factors  make  significant  contributions 
to  the  sigma  and  make  it  very  difficult  to  include  in  a  general  wearout  prediction  model.  By  using 
the  default  sigmas  in  the  model,  a  typical  failure  rate  for  each  mechanism  can  be  obtained,  with  the 
understanding  that  there  are  wide  variations. 

The  assumption  in  choosing  an  average  sigma  is  that  the  random  variations  in  the  process  and 
the  statistical  variations  in  measurement  are  accounted  for  in  the  sigma  of  the  lognormal 
distributions  (applicable  for  the  wearout  mechanisms  oily),  since  the  distribution,  mean  and  sigma 
were  based  on  empirical  data. 

There  was  no  data  available  which  contradicts  the  use  of  the  lognormal  distribution,  even  at 
the  extreme  low  end  of  the  distribution  which  tends  to  be  where  its  accuracy  decreases.  However, 
this  is  not  a  significant  effect  since  the  failure  rate  contribution  of  the  lognormal  distribution  only 
becomes  significant  when  the  device  is  approaching  its  end  of  useful  life,  or  its  wearout  period. 
Therefore,  the  wearout  relationships  will  only  provide  an  estimate  for  the  end  of  life  and  very  little 
information  about  the  failure  rate  during  the  useful  life  of  die  device. 

3.2  RELATIONSHIP  TO  THE  GENERIC  QUALIFICATION  PROGRAM 

This  reliability  prediction  modeling  effort,  since  it  is  for  state  of  tire  art  CMOS  technology, 
should  be  coordinated  with  and  related  to  other  VHSIC/VHSIC-Like  technology  efforts  such  as 


Efforts  were  made  to  utilize  data  resulting  from  the  Yield  Enhancement  and  Generic 
Qualification  efforts  as  input  to  the  detailed  model.  Key  dements  of  these  programs  that  were  of 
interest  are  the  mat  structure  testing  performed  on  a  regular  basis.  There  are  three  elements  to  this 
testing: 

(1)  Parametric  Monitor  (PM)  testing  of  individual  transistor's,  diodes,  and  capacitors  on 

every  wafer.  Undoubtedly  there  could  also  be  a  transistor  with  which  to  measure 
substrate  current  for  the  hot  carriers  t  §q  equation. 
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(2)  Technology  Characterization  Vehicle  (TCV)  is  used  for  periodic  time  dependent  failure 
mechanisms  for  the  same  mechanisms  used  in  prediction  models.  Thus,  companies 
involved  in  Generic  Qualification  would  have  acceleration  factors  that  might  be  used  in 
their  own  models.  It  is  also  expected  that  fast  wafer  level  tests  will  be  developed  and 
performed  on  every  wafer  run  for  electromigration,  TDDB,  and  hot  carriers. 

(3)  Standard  Evaluation  Circuits  (SEC)  are  processed  on  a  regular  basis  and  regularly 
subjected  to  life  tests.  Perhaps,  these  could  serve  as  the  reference  chip  in  the 
electromigration  and  TDDB  equations. 

In  addition,  the  Generic  Qualification  program  instills  higher  quality  throughout  the 
manufacturing  process  from  design  to  assembly  and  shipment.  Therefore,  the  short  form  model  to 
be  presented  later  has  a  quality  factor  associated  with  a  manufacturer  who  can  demonstrate  this 
level  of  quality. 

After  investigating  the  potential  use  of  these  data  sources,  it  was  concluded  that  designing  the 
model  to  include  specific  data  front  these  sources  would  be  very  difficult,  if  tot  impossible  due  to 
the  variability  between  manufacturers  in  die  specific  manners  in  which  they  obtain  the  data.  For 
example,  the  SEC  circuit  provides  much  information  useful  for  reliability  studies  but  is  not 
standardized  between  manufacturers.  Therefore,  it  was  concluded  that  the  level  of  standardization 
necessary  for  input  to  a  reliability  model  has  not  yet  been  achieved.  In  fact,  in  many  cases  it  would 
be  undesirable  to  have  a  very  standardized  method  of  determining  reliability  characteristics.  This  is 
due  to  the  fact  that  there  are  many  variables  in  the  design  and  fabrication  of  a  circuit  and  die  best 
manner  to  test  for  their  reliability  characteristics  is  not  always  the  same  and  may  be  unique  for  a 
particular  process. 

Since  it  is  very  important  to  make  use  of  the  data  from  these  efforts,  1ITRI  and  Honeywell 
believe  that  an  effective  manner  in  which  to  accomplish  this  is  to  assign  a  base  failure  rate  (for 
elceuomigration,  time  dependent  dielectric  breakdown,  and  hot  earner  effects)  as  a  function  of  a 
manufacturers  ability  to  prove  they  have  the  failure  mechanisms  under  control. 


This  approach  has  merit  because  well  designed  circuits  typically  exhibit  very  low  failure  rates 
for  these  mechanisms  when  used  in  typical  applications.  In  fact,  there  has  been  very  few  instances 
of  field  failures  reported  due  to  these  mechanisms.  The  database  developed  for  this  modeling 
effort  supports  this  view  since  it  also  contains  very  few  failures  due  to  these  mechanisms.  These 

low  failure  rates  are  then  reflected  in  the  quality  factor  for  those  manufacturers  on  the  QML 
(Qualified  Manufacturers  List)  for  the  short  form  model  and  in  the  t^g  expressions  for  the  wearout 

mechanisms  in  the  detailed  model.  This  approach  is  consistent  with  the  generic  qualification  effort 
which  allows  some  flexibility  in  the  methods  used  by  the  manufacturer  in  proving  the  adequacy  of 
a  design. 

The  detailed  model  accounts  for  the  manufacturing  process  capabilities  primarily  in  the  defect 
density  factor.  Presumably,  a  manufacturer  with  good  process  controls  such  as  those  on  a  QML 
or  QPL  (Qualified  Products  List)  will  have  a  lower  defect  density  than  one  without  those  controls 
in  place.  For  this  reason  the  differences  between  the  QML  or  QPL  status  of  a  manufacturing 
process  tumid  inherently  be  accounted  for  in  the  defect  density. 

The  short  form  model,  however,  cannot  use  defect  density  as  an  input  since  most  users  of  tire 
short  form  mod*!  will  rarely  have  access  to  specific  values.  Therefore,  a  default  defect  density  was 
derived.  Additionally,  the  short  form  model  includes  a  factor  based  on  the  QML  or  QPL  status  of 
die  manufacturing  process.  Tins  factor,  discussed  further  in  Section  7.1.4,  was  based  on  the 
observed  variation  in  reliability  between  manufacturers,  presuming  that  these  differences  are 
accounted  for  in  the  QML  process. 

Guidelines  for  one  method  of  calculating  the  detect  density  for  both  oxide  and  metal  are  given 
in  Appendix  B.  These  are  calculated  separately  indicating  that  if  only  the  oxide  defect  density  is 
known,  it  can  be  used  and  the  default  condition  for  metal  defect  density  can  be  used 
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4.0  DATA  COLLECTION  AND  DATABASE  DEVELOPED  FOR  THIS  PROGRAM 


4.1  L-  iTABASE 

A  database  was  developed  to  hold  and  analyze  the  empirical  data  collected  in  this  effort.  This 
database  has  been  developed  for  an  IBM  PC  and  contains  provisions  for  entering  the  following 
information  about  the  device  itself,  the  stresses  it  was  exposed  to,  and  the  results  of  those  stresses: 


Device 

Stress 

Part  Number 

Power  Dissipation 

Manufacturer 

Test  Type 

Part  Description 

Junction  Temperature 

Package  Type 

Current  Density 

Number  of  Pins 

Oxide  Voltage 

Thermal  Resistance 

Test  Conditions 

Technology 

Gate  Count 

Ambient  Temperature 

Feature  Size 

Die  Dimensions 

IgSLtolM 

Metal  Length 

Number  Te,sted 

Metal  Width 

Number  Failed 

Number  of  Metal  Layers 

Time  to  Failure 

Metal  Material 

Failure  Mechanism 

Gate  Length 

Oxide  Thickness 

Oxide  Area 

Metal  Defect  Density 

Oxide  Defect  Density 

Yield 

ESP  Susceptibility 

Failure  Mode 

The  type  of  test  information  collected  was  primarily  operating  life  test,  bum  in,  and  various 
environmental  tests.  The  failure  causes,  feature  size,  and  die  area  of  a  profile  of  this  database  is 
given  in  tire  following  sections.  The  detailed  data  contained  in  this  database  is  given  in  Appendix 
D. 
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4.1.1 


Since  the  objective  of  this  modeling  effort  was  to  quantify  the  failure  rate  of  each  failure 
mechanism,  knowing  the  cause  of  each  observed  failure  was  very  important  The  cause  of  failure 
was  known  for  approximately  half  of  all  failures. 

Table  4-1  illustrates  the  number  of  observed  failures  as  a  function  of  tire  failure  cause  (failure 
class)  and  test  type.  Each  observed  failure  that  had  a  reported  associated  failure  cause  (failure 
mode,  mechanism,  or  cause)  was  classified  into  one  of  the  failure  classes  listed  in  this  table,  thus 
facilitating  Mute  rate  modeling  of  each  failure  class. 

One  data  source  contributed  a  large  number  of  failures  for  which  die  failure  class  was  rot 
entirely  defined.  Therefore,  Table  4-2  summarizes  the  same  information  as  Table  4-1  but  excludes 
that  data  source.  As  explained  in  Section  3.0,  the  unknown  data  was  assumed  to  have  the  same 
relative  weightings  as  the  known  failures  when  performing  the  actual  modeling.  Tins  modeling 
was  accomplished  by  increasing  the  predicted  Mure  rate  of  each  of  die  known  mechanisms  by  a 
given  percentage.  In  this  manner  all  failures  m  accounted  for. 


TABLE  4-1; 

TOTAL  NUMBER  OF  FAILURES  OBSERVED  AS  A  FUNCTION 
OF  TESTTYHj  AND  FAIL  CLASS 
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Tempctanite 
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Bfivhtmmenm! 
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•  - 
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54 

4 
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21 
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21 
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24 
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0 
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6 

71 
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44 
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5 

53 
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7 
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5 

0 
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J1 
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• 
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15 
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TABLE  4-2: 

NUMBER  OF  FAILURES  OBSERVED  AS  A  FUNCTION 
OF  TEST  TYPE  AND  FAIL  CLASS  EXCLUDING  ONE 
DATA SOURCE 


Failure 

Class 

Test 

■EBM 

Environmenta1 

Total 

Unknown 

437 

54 

4 

495 

Assembly 

21 

0 

0 

21 

Package 

22 

214 

0 

236 

Metal 

62 

3 

6 

71 

Misc. 

44 

4 

5 

53 

Oxide 

111 

7 

0 

118 

Contamination 

4$ 

5 

0 

53 

Wearout 

_1 

JQ 

7 

Total 

752 

207 

15 

1054 

4.1,2  BaimSizg-Emfik 

The  distribution  of  feature  sizes  present  in  the  database  (in  those  coses  where  it  was  known) 
is  given  in  Figure  4-1.  Although  devices  with  feature  sizes  in  the  3-5  micron  area  were  beyond  the 
scope  of  this  study,  they  were  included  to  make  provisions  in  die  database  to  identify  trends  as  a 
function  of  feature  size.  A  weighted  (by  number  of  tested  devices)  average  of  feature  sizes  in  this 
databaseislOpm. 
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FIGURE  4-1: 

FEATURE  SIZE  PROFILE 


4.1.3  Die  Area  Profile 

The  distribution  of  die  areas  present  in  the  database  is  given  in  Figure  4-2.  The  weighted  die 
area  by  number  of  devices  is  .21  cm^. 
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(1  cm  2  =  155,000  mil  2  ) 


FIGURE  4-2: 
DIE  AREA  PROFILE 
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5.0  FAILURE  MECHANISM  CHARACTERIZATION 


5.1  OXIDE  FAILURE  RATE 

5.1.1  Oxide  Early  Life  Failures 

The  early  life  failure  rate  for  oxide  related  failure  mechanisms  was  developed  using  the 
methodology  outlined  in  Section  3.0.  Table  5-1  summarizes  the  data  used  in  development  of  this 
factor.  In  this  table,  the  accelerated  time  interval  is  die  effective  time  interval  (in  10^  hrs.)  at  25°C 
(using  the  oxide  temperature  acceleration  factor  with  an  activation  energy  of  .3  eV).  The 
accelerated  part  hours  are  the  total  effective  observed  oxide  part  horns  at  25°C  in  each  time  interval, 
and  the  last  column  is  the  number  of  observed  oxide  failures  occurring  in  each  interval. 

TABLE  5-1: 

OXIDE  FAILURE  DATA 


Equivalent  Time 

Interval  at  25°C(106  hrs.) 

Accelerated 

Part  Hours  (10^  hrs.) 

Number 
of  Failures 

0-0.002344 

644.832020 

3 

0.002345-0.008204 

1601.110137 

13 

0.008205*0.01 6950 

1648.146367 

1 

0.01695 1-0.0244  It 

1372,043816 

19 

0.024418-0.048834 

4309.957812 

229 

0.048835-0,049224 

6.008332 

2 

0.049225-0.050446 

15.044018 

1 

0,050447-0.138452 

524.665804 

2 

0,1 38453-0.193123 

35,612394 

3 

Selecting  two  discrete  ranges  for  die  regression  solutions  yields  the  following  data  in  Table 
54,  These  ranges  were  defined  based  on  die  quantity  of  data.  For  example,  there  was  a  large 
quantity  of  data  from  devices  that  were  tested  at  an  equivalent  time  of  .048834,  therefore  0- 
.048834  was  one  of  the  ranges  chosen.  It  should  be  noted  Here  thatmlatively  large  ranges  of  dine 
had  to  be  grouped  into  each  of  the  two  ranges  because  in  many  cases  sources  reported  failures 
occurring  within  a  time  interval  For  example,  if  the  devices  on  test  v re  tested  only  at  500  hours, 
the  failures  observed  actuary  occurred  prior  to  500  hrs*  although  the  exact  time  is  not  known. 
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TABLE  5-2: 

OXIDE  EARLY  LIFE  SUMMARIZED  FAILURE  RATE 


Effective 

Effective 

Number  of 

Failure 

Range 

BlwM 

Part  hrs.  (10*>) 

Failures 

Rate  (F/106  hrs.) 

0-.O48834 

.024417 

9576 

265 

.0277 

.048835-.  193 123 

.12098 

582 

8 

.0137 

Regressing  on  this  data  and  adjusting  the  time  constant  to  insure  that  the  cumulative  failure 
rates  are  equivalent  for  die  predicted  and  observed  cases  yields: 

(t)  =  .0331  e’7*7 1 

Since  failures  of  unknown  causes  represented  58%  of  the  data,  the  failure  rate  derived  here 
should  be  multiplied  by  2.38  to  compensate  for  the  unknown  failures.  This  assumes  that  the 
unknown  failures  have  die  same  relative  rate  of  occurrence  between  mechanisms  as  do  the  known 
failures.  Since  the  cumulative  failure  rata  is  direcdy  proportional  to  the  constant  (iii  this  case 
.0331),  this  constant  should  be  multiplied  by  2.38,  yielding  the  Mowing  hazard  rate: 

early  life  *  ^788  e  7,7 1 

Combining  the  effects  of  tarqrcraiure  and  screening  time  yields; 

early  life  “  -0788  «'7'7  ATox ' 


There  are  several  factors  that  already  affect  the  early  life  oxide  failure  rate  that  are  not  present 
in  this  expression.  One  such  factor  is  the  oxide  electric  field.  To  be  able  to  include  this  factor  the 
electric  field  would  need  to  be  known  for  all  devices  in  the  database,  since  it  is  an  empirical 
relationship.  Unfortunately,  the  electric  field  was  rarely  reported,  making  it  impossible  to  include 
this  effect  This  early  life  oxide  model  therefore,  as  with  the  other  early  life  failure  rates, 
represents  an  industry  wide  average  observed  hazard  rate. 

The  oxide  wearout  hazard  rate  however,  has  included  the  effects  from  factors  such  as  electric 
field  since  it  is  a  more  theoretically  based  factor. 

5.1.2  Qsi&JfeM 

Time  dependent  dielectric  breakdown  is  considered  an  oxide  wearout  mechanism  and  has 
empirically  been  shown  to  follow  the  lognormal  time  to  failure  distribution.  Since  a  lognormal 
distribution  is  uniquely  defined  with  a  mean  (159)  and  standard  deviation  (cr),  the  intent  of  the 

effort  to  quantify  the  oxide  wearout  failure  rate  was  to  quantify  the  t5Q  and  cr  as  a  function  of 
applicable  stress  and  fabrication  variables. 

Although  many  researchers  have  studied  time  dependent  dielectric  breakdown  thoroughly, 

the  basis  physics  of  failure  is  still  not  completely  understood.  This  fact  makes  it  difficult  to  derive 
a  general  t§0  model  valid  fos*  all  manufacturing  processes  in  all  situations.  As  explained  in  Section 

3.0  of  this  report,  however,  the  wearout  mechanisms  only  provides  an  indication  of  an 

approximate  end  of  life  time  and  contribute  very  little  to  the  failure  rate  in  the  useful  life  of  Urn 
device.  Therefore,  if  a  manufacturer  can  prove  that  the  tgg  for  oxide  wearout  for  his  particular 

process  is  much  greater  than  the  life  expectancy  of  a  part,  this  factor  can  essentially  be  ignored  in 


For  these  reasons,  tife  failure  rate  for  oxide  wearout  <TDDS)  was  derived  as  a  function  of 
ismpmum,  electric  field  in  the  oxide,  oxide  area  and  detect  density.  These  relationships  were 
derived  from  empirical  data  feom  various  researchers  on  oxide  reliability.  The  following  sections 

'  {fop  pf  jjhfis#  fecimre. ' 


5. 1.2.1  Temperature  Acceleration  Factor 


The  basic  temperature  acceleration  follows  the  Arrhenius  relationship  of  the  following  form, 

where  A-p^  =  Temperature  Acceleration  Factor  for  Oxide 

T  =  Temperature  (in  degrees  Kelvin) 

Tq  »  Reference  Temperature  =  298°K 

K  *  Boltzman's  Constant  =  8.63  a  10*^  (ev/°K) 

Tj  »  Average  Junction  Temperature  (°C) 

Ea  =  Activation  Energy  (eV) 

Based  on  test  results  contained  in  the  literature,  the  commonly  accepted  activation  energy  is 
.3  eV,  which  has  also  been  experienced  by  Honeywell,  and  therefore  is  the  nominal  value  to  be 
used  in  this  model  The  temperature  accdsratiaa  factor  for  TDDB  is  therefore  the  following; 


5. 1.2.2  Electric  loeld Aceetemtiott  Factor 


Crook  (Reference  27)  had  originally  derived  a  form  for  the  acceleration  factor  due  to  tire 
decide  stress  applied  across  an  oxide.  This  tonn  was: 


AgpOicsp 


'EREf?  -  EsI 

sr~l 


*■*»-.  Rrdeitmee Electric Held 3MV/cra 
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Domangue  (Reference  31)  has  conducted  tests  at  various  electric  field  stresses  (3,  4,  and  5 
MV/cm)  from  which  an  acceleration  factor  fan  be  obtained.  Analysis  of  his  data  yields  an  Eg  = 

.1324  MV/cm  and  therefore  an  acceleration  factor  can  be  stated  as  follows: 

*  r3  •  Esi 

AEF=exP[T53rj 

More  recent  work  by  Hu  (Reference  39)  on  oxides  closer  to  those  types  and  thicknesses  used 
in  VHSIC  circuits  has  shown  that  the  time  to  breakdown  is  related  to  the  electric  fiela  in  the 
following  manner: 

B+H 
tBD  a  e  SojT 

where  B  «  240  MV/cm 
H  »  80MV/cm 


It  can  be  seen  that  this  acceleration  factor  has  enormous  changes  for  reasonable  changes  in 

0  0 

Eox.  Bor  example,  a  5  volt  part  that  has  300  A  and  100  A  oxides  (Eox  of  1,67  and  5  MV/cm 
respectively)  would  have  acederation  factors  differing  by  56  orders  of  magnitude. 

Hu  had  used  a  smaller  number,  192,  instead  of  320  for  comparing  *he  1  %  cumulative  failure 

points  as  opposed  to  tire  50%  cumulative  failure  point,  arguing  thut  the  value  should  be  smaller  for 

defect  related  breakdown  dm  for  near  intrinsic  breakdown.  Since  is  this  mode!,  the  1% 

cumulative  failure  point  is  of  much  mote  interest  than  the  50%  point,  the  constant  of  192  will  be 
used  in  this  factor.  Therefore,  using  an  exponent  of  192  and  choosing  the  Ay  term  to  agree 


In  addition  to  the  electric  field,  various  researchers  hiive  also  observed  a  relationship  between 
the  acceleration  factor  and  oxide  thickness.  Derivation  of  this  factor  (from  Reference  26)  based  on 
empirical  studies  yields: 

Acceleration  a  4.2  log  Tox  -  6.95 

Therefore,  the  factor  that  could  be  used  in  the  oxide  model  is  therefore: 


where  ^  oxR  “  Reference  oxide  thickness 

»  Actual  oxide  thickness 

Since  the  variation  in  this  factor  is  very  small  compared  to  the  electric  field  acceleration 
factory  it  was  not  included  in  the  model  since  the  uncertainties  in  the  electric  field  factor  are  much 
larger  than  the  oxide  thickness  factor  itself. 


$.1.2*3  Dio  Area  and  Delect  Density 


This  section  discusses  the  relationship  between  die  mm:,  defect  density,  and  yield.  'Although 
it  Ifciapludcd  in  this  section,  the  discussion  of  yield  effects  is.  not  specific  to  oxide,  hr  this  model, 
there  ate  separate  delect  densities  for  oxide  and  metal,  aiihough  they  use  the  same  default  value, 
which  isafuaaioa  of  feature  sice. 


Phillips  (Reference  28)  has  suggested  the  use  of  extreme  value  statistics  to  derive  the 
area/defect  density  relationship  of  gate  oxides.  In  extreme  value  statistics,  the  hazard  rate  can  be 
closely  approximated  by  the  exponential  function.  Thus, 

h(x)=  vm 


d  F(x) 

dx 


TABLE  5*3: 

ELECTRIC  FIELD  ACCELERATION  FACTOR  (A^) 
AND  ACTIVATION  ENERGY  (E*)  IN  LITERATURE 


Reference 

Material 

?ox<A) 

Temp.  (C) 

AEF(1/ (MV/CM)) 

Ea(eV) 

29 

Poly 

>400 

25-160 

10? 

.3-25 

30 

A1 

450 

85-250 

101.5-k5 

2.1 

31 

Poly 

390 

25-150 

102.0 

.36 

32 

Poly 

400 

- 

10**? 

.26 

33 

Poly 

100-200 

• 

104 

1.0 

34 

Poly 

1CKMO0 

1056 

- 

35 

Poly 

100 

25-150 

101*87.102.0 

.3 

36 

Poly 

200 

20-250 

102*5 

.3 

37 

my 

60-100 

170-250 

10L74.jo1,9 

1,0- U 

3S 

Poly 

no 

25-275 

10L41 

.23 

From  the  data  in  Reference  28  and  from  extreme  value  statistics,  for  a  gate  oxide  area  A}  with 
average  breakdown  E} 

h1(E)  =  exp(SN(E-E1)/Sv-YN) 
and  similarly  for  a  gate  oxide  area  A2  with  average  breakdown  E2 

1*2(2)  *  exp  (Sjvj(E“E2/Sy-Yjj) 

Thus; 

hi  (EVh2  OB)  =  exp  (Sn(E2-E!)/Sv) 

It  is  noted  the  SN  and  YN  are  constants  that  depend  only  on  die  sample  size  and  that  Sv  is  the 
standard  deviation  and  the  same  for  both  distributions  (Aj  and  A2)> 

It  is  also  clear  from  the  data  presented*  which  is  supported  by  die  extreme  value  st  ttisdc  tlieoiy , 
that 

InC-ht  (i-Fj))  -  to  tfn(i-Fg)  *  InCAj/Aj) 
awl 

•  Sf,  (E-E^Sy * ta(A'i/A2) 

•.  Thus;  . 

^(E^E^v^InCA^ 

■  and  . 

i  blCEl/foCE*  (I4A|/A2) «  A j/A2 


Thus,  the  ratio  of  the  hazard  rates  is  equal  to  the  ratio  of  the  gate  oxide  arm  in  extreme  value 
statistics. 

This  factor  indicates  that  if  reliability  data  is  available  on  a  circuit  such  as  a  SEC,  the  hazard  rate 
could  be  extrapolated  to  the  entire  chip  by  knowing  the  gate  oxide  areas.  Or,  if  the  average  arm  of 
the  devices  in  die  database  is  known,  it  can  be  used  as  the  reference  area. 

The  following  discussion  summarizes  another  analysis  that  was  conducted  to  determine  the 
relationship  between  both  area/defect  density  and  hazard  rate. 

If  f(t)  is  the  failure  density  function,  then  the  cumulative  failure  density  function  is  given  by: 

f 

Q(t)*/f(t>  dt 


Crook  (Reference  27)  used  another  expression  for  Q(t)  which  was  said  to  be  from  Li  and 
Maserjion  (Reference  41),  which  was  actually  obtained  front  Price  (Reference  42).  Price  derived 
the  following  expression  in  1970  from  Bose*Et«sU5in  statistics,  staring  that  earlier  models  using 
Boltzman  statistics  wens  inaccurate; 

Qg(t)«l/a+i/AsDs) 

A$  is  the  -area,  Ds  is  the  defect  density  and  Q§(t)  is  the  eutasdaUve  failure  density.  Price  derived 

•this  expression  for  integrated  circuit  yields.  On  the  assumption  that  the  effective  defects  are 
tindomly  distributed  along  two  dimensions  and  are  thus  iudisringuisitable,  ii  and  hiaserjtiut  found 

the  equation  applicable  far  the  dfffeedve  defect  density  for  rime  dependent  dielectric  breakdown. 
No&  dial  0  or  D$  is  a  ftmeiioa  of -time,  tn  rite  discussion  that  follows  the  subscript  s  'indicates  that 

these  values  are  for  the  snesa  test  structure  from  which  data  is  generated  in  the  laboratory. 
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With  the  above  formula  it  follows  that  for  a  second  chip  with  area  Aq  and  the  same  defect 
density  the  cumulative  failure  density  is  given  by: 

QO  (t) «  WHAJAq)  (1/Qs(t)-1)) 

In  a  similar  fashion  it  can  be  shown  that  for  a  third  chip  with  the  same  area  as  the  second  chip 
Aq,  but  a  different  defect  density  a  cumulative  failure  density  function  is  given  by: 

Qn  (t)  =»  l/(l-KDo/Dn)  (l/Qo(t)-l» 

If  the  last  two  equations  am  combined,  the  resulting  equation  becomes 

On  <D  =  1/a^As/D^Ao  (i/Qs  (t}-l)> 

or 

%  (t) «  WMPsWW*  (*/Qs  (0  -  D> 

From  this  expression  it  is  dear  that  as  the  defect  density  and/or  tire  gate  oxide  area  gets  large, 
the  cumulative  number  of  failures  gets  large. 

Crook  also  showed  that  ti»  hazard  rats  (the  probability  that  a  device  will  M  in  the  time  (tnU)  if 
U  has  already  survived  until  t)  is  found  by  differentiating  Q(0  mid  amounts  to 

MO -1/(1^  (t» 

Ffewi  the  above  discussion  a  failure  tate  for  time  dependant  dielectric  breakdo m,  of  a  given 
chip  can  bo  found  if  all  of  the  following  m :  known: 

(I)  the  sigma  and  tire  t^  of  the  TDDB  failures  of  a  test  structure  with  a  known  defect  density 
and  area; 

.  (2)  voltage  sna  thermal  acceleration  factors  if  the  test  structure  work  was  performed  at 


0)  tixs  areatiad  defect  density  of  the  given  chip. 
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Unfortunately,  the  calculations  involve  the  integral  of  a  complicated  function.  To  deal  with  this 
problem  numerical  integration  was  used,  to  assess  the  effect  of  changing  the  area  and  defect 
density.  It  immediately  became  clear  that  for  a  small  number  of  cumulative  failures,  the  formulas 

could  be  greatly  simplified.  From  the  last  cumulative  failure  density  function  expression  above,  if 
Qg  (t)  is  small,  then 


On  (t)  =  l/(DsAs/DnAn)  (1/Qs  (t)) 
=  DnAnQs®/DSAS 


Thus,  the  cumulative  failure  density  function  is  directly  proportional  to  the  defect  density  and 
the  area  of  the  chip.  Since  the  cumulative  failure  density  function  is  the  integral  of  the  failure 
density  function,  it  follows  that  the  failure  density  function  rs  directly  proportional  to  the  area  and 
defect  density.  For  the  first  few  failures  the  instantaneous  failure  rate  is  equal  to  the  failure  density 
function,  and  thus  it  is  directly  proportional  to  defect  density  and  area.  Thus,  in  the  first 
approximation  to  double  the  chip  area  (or  defect  density)  is  to  double  the  chip  failure  rate. 

In  the  previous  paragraphs  some  relationships  were  established  between  the  cumulative  failure 
probability  Qn»  and  the  number  of  defects,  D,  and  the  chip  area,  A.  These  relationships  were 

based  on  a  model  developed  by  Price  (Reference  42)  using  Rose-Einstein  statistics.  A  more 
general  defect  model  was  derived  by  Stapper  (Reference  43).  In  this  model  the  probability  defect 
density  function  is  related  to  the  gamma  distribution.  For  this  model  the  yield  is  given  by 

Y«  1/(1+ADq/S)^ 

where  A  is  the  chip  area,  Dq  tits  average  defect  density,  and  S  a  shape  parameter.  This  model 

assumes  that  the  defects  have  a  given  distribution  pattern  across  tits  wafer.  In  the  limiting  case 
where  S  approaches  0,  the  distribution  is  a  delta  function,  meaning  the  defect  (tensity  is  constant 
and  all  defects  ait'  randomly  distributed  and  independent  Tins  condition  leads  to  die  Poisson  yield 
estimate 


Y-e-1** 

Also  it  can  be  seen  that  when  die  S  of  the  Stapper  model  is  1,  die  yield  model  reduces  to 

Y«l/(l+ADfo) 
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which  was  described  previously  as  the  Price  model,  but  actually  used  earlier  by  Seeds  (Reference 
44).  In  this  case  the  defect  density  distribution  is  exponential.  A  comparison  of  these  and  two 
other  yield  models  is  shown  in  Figure  S-l.  The  Poisson  model  is  the  most  pessimistic  and  the 
Seeds  model  is  the  most  optimistic.  Hie  S tapper  model  can  be  adjusted  by  the  shape  parameter  to 
cover  the  area  in  between.  The  following  derivation  is  similar  to  tkat  presented  previously  with  the 
exception  that  the  more  general  S  tapper  model  is  used. 


The  cumulative  probability  of  failure  plus  the  yield  must  add  to  1 .  Thus 

Q(t)  =  l-Y  =  M/(l+D0A/S)S 


If  Qjj(t)  is  the  cumulative  failure  probability  of  a  circuit  with  Dn  average  defects  and  An  area, 
and  QjjOi)  is  the  cumulative  failure  probability  of  a  different  circu;t  with  area  An  and  the  same 

average  defects,  then 


If  the  above  is  solved  lor  Qq  (t)  in  terms  of  Qjj  (t)  and  the  two  areas,  we  get 


00(0*1 


M 


Aq  r  i 

l+K  rRsrsrm-1 


la  a  similar  fashion 


a#o  r 


The  above  equation  relates  the  cumulative  failure  probability  of  a  given  device  to  that  of  another 
devise  its  a  fhacdoa  of  area  and  defects. 

To  help  undei’sKtittl  this  relationship  a  number  of  gr  aphs  were  generated.  *  igures  5-2, 5-3,  and 
5-4  show  the  effect  of  the  shape  factors  for  different  AoD^AnDn  ratios.  From  these  graphs  it  is 

dear  that  when  the  area  defect  factor  is  near  unity  (that  is,  die  chips  have  approximately  the  same 

areas  and  defect  distributions),  the  shape  factor  S  has  modest  effects  on  the  relative  failure 
probability.  Wheat  the  area  defect  factor  is  large  (that  is,  the  Qp  ^evsee  has  a  much  larger  area 
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FIGURE  5-1:  YIELD  AS  A  FUNCTION  OF  DEFECTS  i 


CUM  FAILURE  PROBABILITY 
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CUM  FAILURE  PROBABILITY 
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CUM  FAILURE  PROBABILITY 


a 

+ 
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FIGURE  5-4:  CUM  FAILURE  PROBABILITY  FOR  A0D0/AnDn  =  100  AND  S  =  .01,  .1, 1,  AND  2 


and/or  defect  density),  then  the  effect  of  the  shape  factor  is  more  pronounced.  Stappcr  indicated 
that  IBM  had  seen  S  range  from  0.1  to  2.2  depending  upon  the  type  of  defect,  however,  it  takes 
sub£!2T*a!  quantities  of  data  to  establish  an  S  value. 

The  model  presented  herein  assumes  that  the  areas  and  defect  densities  of  the  devices  used  to 
derive  these  factors  (from  the  database)  are  similar  to  those  that  predictions  will  be  performed  on, 
and  therefore  that  the  direct  relationship  between  area/defect  density  and  hazard  rate  is  valid. 

Figure  5-5  demonstrates  the  effect  of  the  area  defect  factor  when  S=l.  This  is  the  Seeds  or 
Price  model.  From  this  graph  it  can  be  seen  that  for  cumulative  failure  probabilities  less  tlian  10%, 
the  cumulative  failure  probability  of  one  device  is  equal  to  the  defect  area  factor  times  the 
cumulative  failure  probability  of  the  other  device.  This  demonstrates  graphically  that  for  the  S=1 
model  the  cumulative  failure  probability  (and  thus,  the  failure  rate)  is  directly  proportional  to  the 
level  of  defects  and  the  chip  area.  Intel  (Reference  27)  has  also  presented  data  to  support  this 
theory. 

Figure  5-6  shows  a  slightly  different  area  defect  factor  effect  when  S-.01  which  is  close  to  the 
Poisson  defect  distribution.  In  this  instance  the  curves  deviate  from  the  asymptote  at  a  slightly 
lower  point  Again  the  cumulative  failure  probability  is  directly  related  to  the  area  defect  factor. 

The  above  analysis  seems  to  indicate  that  for  Ute  first  few  failures,  which  are  die  only  ones  of 
interest  here,  tire  probability  of  failure  is  directly  proportional  to  die  defect  level  and  to  tire  chip 
area.  Hus  analysis  is  based  on  yield  theory  and  lias  been  demonstrated  to  be  consistent  with  the 
Phillips  model  (Reference  2S)  to  be  discussed  later. 

In  the  previous  discussion  a  general  model  was  presented  to  relate  the  cumulative  TDDB 
failures  for  otic  device  ,f>  that  for  another.  The  key  factors  were  device  area  and  number  of  defects. 
This  reliability  model  was  based  on  the  Stopper  yield  model  and  includes  a  curve  fitting  or  shape 
factor  S.  The  following  model  analyzes  how  some  published  data  fits  Uiat  model. 


In  1979  Intel  published  TDDB  data  for  a  number  of  different  sized  capacitors  (Reference  27). 
This  data  showed  the  cumulative  number  of  failures  in  a  given  period  of  time  for  populations  of 
different  sized  capacitors.  Figures  5-7  and  5-8  illustrates  those  data  points  plotted.  The  four  lines 
represent  four  area/defect  ratios  consistent  with  Intel's  data.  The  different  capacitors  they  used 
were  all  from  the  same  set  of  wafers  so  that  the  defect  levels  should  have  been  consistent 
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FIGURE  5-5:  CUM  FAILURE  PROBABILITY  FOR  S  =  1  AND  AD  =  1, 5, 10  AND  100 
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FIGURE  5-7:  CUM  FAILURE  PROBABILITY 
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The  data  for  S=1 ,  representing  the  Seeds  model,  seems  to  show  the  right  trend.  Indeed,  this 
was  the  model  that  Intel  used.  To  try  to  have  the  model  better  fit  the  data,  two  approaches  were 
tried.  First,  curves  for  different  values  of  S  (from  .2  to  10)  were  generated.  Second,  part  1  in  the 
graph  was  changed  from  first  to  middle  to  last  data  point.  The  results  from  that  data  are 
summarized  in  Table  54. 


TABLE  54: 

STAPPER  MODEL  WITH  VARIOUS  S  VALUES 


Device 

(xlO6^2) 

Cum  Fails 
(in  3x10^  sec.) 

Part  1 

Standard 

S  - 1 

Part3 

Standard 

S«1 

Pa«5 

Standard 

S«1 

Part  3 

Standard 

S  =  .3 

Part  3 

Standard 

S  =  4 

.7 

12% 

16% 

16% 

23% 

15% 

1.55 

29% 

24% 

32% 

31% 

34% 

32% 

1.9 

•  36% 

27% 

- 

34% 

- 

- 

2.5 

46% 

32% 

43% 

42% 

42% 

44% 

3.0 

44% 

36% 

4S% 

«* 

43% 

51% 

While  in  genera!  it  can  be  said  that  smaller  values  of  S  tend  to  dose  or  narrow  the  range  of 
she  data  points,  reasonable  values  of  S  do  not  bring  the  lowest  point  down  enough  (Intel's  12% 
point)  without  significantly  raising  the  data  points  for  the  larger  areas.  Because  die  bud  data  is  not 
perfect  (the  largest  capacitor  actually  had  fewer  imbues  than  the  next  largest  one),  it  is  not  likely 
that  any  model  will  fit  all  dm  dam  points  well  Thus,  the  S  « 1  model  seems  like  a  reasonable  one 
to  start  with,  although  them  is  not  enough  data  to  refine  that  parameter  estimate  at  this  time. 

Leo  (lvefereiicc  45)  has  also  derived  a  model  based  upon  the  Poisson  distribution  function  for 
randomly  distributed  defects,  in  this  model  die  probability  that  a  sample  would  contain  at  least  one 
defect  with  "Weakness  factors1'  larger  than  the  value  w  was  given  by; 


P(w)  a  1^'A.iXw) 


where  A  is  the  oxide  area  and  D(w)  is  the  total  area  density  with  "weakness  factor"  larger  than  w. 
It  was  also  empirically  found  that  the  time  to  breakdown  was  related  to  the  weakness  factor  by; 


G 


where  Eox  is  the  electric  field  in  the  gate  oxide  and  G  is  a  factor  specific  to  each  process.  Thus, 

the  above  probability  can  be  seen  to  indicate  the  cumulative  fraction  of  parts  that  would  fail  at  a 
particular  time  for  a  particular  process  under  a  specific  gate  bias.  By  analyzing  TDDB  data  for  six 
different  capacitors  from  2xl0*^cra^  to  5x10^  cm^  and  the  above  equations,  the  authors  derived  an 
expression  for  D(w).  They  were  thus  able  to  predict  the  failure  rate  and  TDDB  curve  for  any  oxide 
area  and  electric  field  for  their  oxides.  This  relationship  is  duplicated  in  Figure  5-9. 

Of  particular  interest  are  the  following: 

(1)  The  relationship  between  reliability  and  stress,  area,  and  defects  may  be  modeled; 

(2)  The  relationship  appears  to  be  consistent  with  the  Poisson  distribution  function,  whereas 
it  was  previously  presented  that  Intel's  data  for  reliability  as  a  function  of  area  and  defects 
was  consistent  with  the  Seeds  model  and  not  die  Poisson  distribution. 

(3)  The  defect  measurement  or  "weakness  factors"  can  be  found  indirectly  from  a  lot  of  data 
but  cannot  be  measured  directly. 

At  the  Wafer  Reliability  Workshop  at  lake  Tahoe,  California  in  October  of  1987,  there  was  a 
consensus  that  two  good  models  for  time  dependent  dielectric  breakdown  existed  and  were 
sup|KX?ed  by  convincing  data.  One  was  die  Berkeley  model  aixl  the  other  model  was  described  in 
a  paper  by  Walters  and  Zcgers-Van  Duyahovcn  of  Phillips  Research  Laboratories  (Reference  28) 
of  which  some  aspects  were  previously  discussed.  This  model  is  described  below. 


Perhaps  the  most  unique  aspect  of  (he  Phillips  model  is  the  use  of  extreme  value  probability 
instead  of  the  conventional  lognormal  statistics.  It  is  argued  that  using  die  lognormal  distribution 
which  is  good  in  the  region  of  the  median  is  inconsistent  with  the  weak  link  nature  of  dielectric 
breakdown  -  that  is,  the  device  will  fail  at  some  defect  which  acts  as  die  weakest  spot 
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OXIDE  AREAS  (u)  .001  cm,  (b)  .01  cm,  (c)  .03  c.ir 
AND  fd) .  I  cm2.  FROM  REFERENCE  1 


Extreme  value  statistics  are  often  used  when  the  values  of  interest  are  the  largest  or  smallest 
in  the  distribution.  Mechanical  fracture  of  materials  or  the  electric  bne.Jcdown  of  transformer  oil 
(Reference  46)  are  typical  weakest  link  processes.  Gate  oxide  breakdown  depends  on  intrinsic 
material  properties  as  well  as  defects.  Indeed,  it  is  the  1%,  or  .01%  failure  point  that  is  of  more 
interest  than  the  50%  point 

It  is  argued  that  the  hazard  rate  at  low  values  is  approximated  by  an  exponential  function. 
This  idea  was  proved  by  Gumbel  f '  sference  47)  for  various  distributions  including  lognormal. 

Thus, 


1 

T^OO 


d  F(x)  ,  , 

T3x  =exPW 


and 

In  (*in(l-F(x)))  =  a  linear  function  of  x 

In  the  above  equations  F(x)  is  the  cumulative  probability  of  failure  .at  x.  x  could  be  the 
breakdown  electric  field  or  the  log  of  time  to  failure  at  a  constant  voltage. 


A  large  distribution  or  amount  of  data  is  required  to  make  the  above  equations  work  properly. 

-\ 

The  author  had  breakdown  data  on  some  12000  capacitors  and  grouped  them  into  30  groups  of 
400  and  made  another  distribution  of  the  lowest  value  in  each  of  the  30  groups.  When  plotted  on 
extreme  value  probability  paper,  the  grouped  distribution  resulted  in  a  line  parallel  to  the  original 
distribution  (Figure  5-10).  Of  interest  was  that  the  shift  between  the  two  curves  was  about  In 
(400)  which  happens  to  be  the  log  of  the  ratio  of  the  effective  areas  of  the  two  groups  of 
capacitors.  Thus,  the  shift  corresponds  to  the  increase  in  probability  of  finding  a  defect  among  the 
group  of  400.  This  shift  is  predicted  by  the  "stability  postulate"  of  extreme  value  statistics  and 
should  hold  as  long  as  the  defects  are  distributed  homogeneously  (i.e.,  Poisson).  When  defects 
are  clustered,  the  shift  is  less. 

Since  the  factor  In  (-ln(l-F))  scales  accordingly  to  the  factor  ln(A2/Aj),  it  follows  that; 

In  (-hi  (1-F2))  -In  (-In  (1-Fj))  =  In  (A2/Aj) 
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FIGURE  5-10: 

>WN  DISTRIBUTION  OF  12000  CAPACITORS  (LOWER  CURVE)  AND  THE 
T  VALUE  OF  EACH  GROUP  OF  400  (UPPER  CURVE). 


where  Fj  and  F2  axe  the  fractions  of  the  parts  that  have  failed  with  gate  oxide  areas  Aj  and  A2 
respectively.  From  the  above  it  is  easy  to  show  that 

F2  =  1  -  (l-F1)A2^Al 

This  type  of  equation,  relating  Fj  and  F2,  was  used  earlier  to  make  graphs  based  upon  the 

Stapper  model.  Similar  graphs  are  shown  in  Figure  5-11  and  5-12  with  shaping  parameter  S  =  1 
and  5  respectively.  It  is  seen  that  Figure  5-12  with  S  -  5  gives  a  remarkably  good  fit 

An  expression  was  also  derived  from  the  extreme  value  statistics  of  the  Phillips  model  to 
indicate  that  the  ratio  of  failure  (i.e.,  hazard)  rates  was  equal  to  the  ratio  of  gate  oxide  areas.  Then' 
data  tended  to  support  this  relationship.  Since  this  is  a  simple  and  easy  to  work  with  relationship, 
other  published  works  were  examined  to  find  data  to  support  or  xeject  this  model.  While  many 
researchers  have  presented  various  types  of  time-to-breakdown  data  for  various  conditions,  few 

have  moved  the  next  step  and  presented  failure  rate  data,  especially  failure  rates  for  devices  with 
different  oxide  areas.  While  failure  rates  can  easily  be  calculated  from  t$Q  and  sigma  data,  the 

frequent  use  of  high  sigmas  (greater  than  1)  in  this  type  of  testing  can  result  in  failure  rates  that  are 
extremely  sensitive  to  modest  changes  in  the  data. 

T  he  researchers  at  Berkeley  have  calculated  and  published  failure  rate  data  for  devices  with 
different  oxide  areas  (Reference  39).  A  representation  of  one  of  die  published  graphs  is  shown  in 
Figure  5-13.  From  this  graph  it  is  easy  to  see  that  for  reasonable  product  lifetimes  (10  years  or 
less),  the  failure  rate  of  a  chip  with  lOx  die  area  of  another  chip,  is  lOx  the  failure  rate  of  that  chip 
(assuming  similar  defect  densities).  After  some  years  of  operation,  the  relationship  would  not  be 
as  good.  Thus,  their  data  and  model  agree  reasonably  well  with  *he  Phillips  model. 

The  Berkeley  failure  rate/area  relationsliip  is  based  on  the  assumption  that  the  defects  are  not 
unifonuly  distributed  across  the  wafer  -  that  is,  a  Gamma  distribution  function  and  the  Stapper 
Yield  model  with  S  » .6.  Extending  the  approach  used  in  that  report  produced  the  graphs  shown 
in  Figures  5-14, 5-15  and  5-16. 

Figure  5-14  demonstrates  the  cumulative  failure  probability  of  a  chip  when  the  cumulative 
failure  probability  of  a  smaller  chip  *20%  of  the  oxide  area)  is  known  and  die  shaping  factor  S  « 
.3,  .6  and  .9.  This  chart  indicates  w  when  die  failure  probability  of  the  smaller  chip  is  small,  the 
failure  probability  of  the  larger  chip  will  be  5x  as  much  (the  ratio  of  the  oxide  areas).  Hie  simple 
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ULATTVE  FAILURE  PROBABILITY  OF  2  DEVICES  WHOSE  AREA  X  DEFECT  DENSITY 
HAVE  THE  RATIO  OF  1, 5,  10,  AND  100  WITH  THE  SHAPING  PARAMETER  S=l. 


2  33IA3Q  -QOdd  3drniUJ  WfD 


c4 

T-H 

I 

tn 


S 


o 


ft 


73 


CUMULATIVE  FAILURE  PROBABILITY  OF  2  DEVICES  WHOSE  AREA  X  DEFECT  DENSITY 
HAVE  THE  RATIO  OF  1,  5, 10,  AND  100  WITH  THE  SHAPING  PARAMETER  S=5. 


THEORETICAL  CURVES  OF  FAILURE  RATE  FOR  DIFFERENT  GATE  OXIDE  AREAS 
AT  5V  OPERATION.  TIME  AXIS  CHANGES  AT  DIFFERENT  VOLTAGES 
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relationship  does  not  hold  as  well  when  the  failure  probability  of  the  smaller  chip  is  more  than 
10%.  Figure  5-15  shows  a  similar  graph  where  the  oxide  area  of  the  larger  chip  is  IQx  that  of 
smaller  one. 

In  a  similar  fashion  Figure  5-16  demonstrates  the  case  where  the  area  ratio  is  50.  These 
charts  suggest  that  if  a  large  number  of  chips  with  more  than  one  oxide  area  are  life  tested,  then  for 
the  first  few  failures  the  number  of  failures  for  chips  with  different  area  will  be  proportional  to  the 
area  of  those  chips  (for  the  TDDB  mechanism). 

Since  the  exact  value  of  the  S  factor  to  use  is  not  clear,  Figure  5-17  was  generated  to  show 
the  effect  of  different  S  values.  It  is  clear  that  for  larger  S  values,  the  area  ratio  model  is  a  better 
approximation.  However,  for  S  =  .01,  approaching  the  Poisson  case  where  S  =  0,  the  model  is 
valid  only  for  smaller  cumulative  failure  probabilities.  Table  5-5  demonstrating  some  of  these 
effects  is  shown  below.  In  this  table,  tire  columns  represent  the  cumulative  failure  probability,  the 
chip  area  ratio,  the  S  value,  and  the  cumulative  failure  probabilities  ratio,  respectively.  Ideally,  the 
2nd  and  4th  columns  would  be  the  same.  It  can  also  be  seen  from  this  data  that  the  area  ratio 
model  is  less  consistent  for  larger  area  ratios. 
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4ULATTVE  TDDB  FAILURE  PROBABILITY  FOR  DEVICE  WHEN  CUMULATTV 
iDB  FAILURE  PROBABILITY  OF  ANOTHER  DEVICE  WITH  1/10  GATE  OXIDE 
AREA  AND  SHAPING  FACTOR  S  =  .3,  .6  AND  .9 


TDDB  FAILURE  PROBABILITY  OF  ANOTHER  DEVICE  WITH  1/50  GATE  OXIDE 
AREA  AND  SHAPING  FACTOR  S  =  .3,  .6  AND  .9 


FIGURE  5-17: 

IIMUIATTVE  TDDB  FAILURE  PROBABILITY  FOR  DEVICE  WHEN  CUMULATIV 
TDDB  FAILURE  PROBABILITY  OF  ANOTHER  DEVICE  WITH  i/10  GATE  OXIDE 
AREA  AND  SHAFTS  FACTOR  S  =  .01,  l,  5 


TABLE  5-5: 

AREA,  S  VALUE,  AND  CUMULATIVE 
%  FAILURES  COMPARISON 


CFPj 

A2/A1 

S 

CFiyCFP! 

1% 

5 

.6 

4.8 

10% 

5 

.6 

3.3 

.1%  . 

10 

.6 

10 

1% 

10 

.6 

9 

10% 

10 

.6 

4.6 

10% 

10 

.01 

1.2 

10% 

10 

2 

5.7 

10% 

10 

5 

6.2 

.1% 

50 

.6 

48 

1% 

50 

.6 

31 

10% 

. . . . . 

50 

.6 

6.5 

lit  summary  the  TDDB  portion  of  the  failure  rate  is  modeled  as  directly  proportional  to  the 
gate  oxide  area.  For  this  model,  a  failure  rate  must  be  established  for  a  "standard"  or  "reference" 
chip  of  known  area.  As  stated  previously,  the  average  area  of  the  die  contained  in  the  database  of 
21  ern^  is  to  be  used  for  this  purpose. 

5. 1.2.3. 1  Calculating  Defect  Densities 


The  previous  discussions  have  analyzed  the  arca/defcet  density/yield  relationships  and 
concluded  that  the  oxide  hazard  talc  is  directly  proportional  to  the  die  area  and  defect  density  (D). 
The  most  desirable  means  to  calculate  the  actual  value  of  defect  density  to  be  used  in  the  hazard  rate 
expression  is  to  use  defect  measuring  test  structures  of  the  actual  fabrication  process.  One  way  to 
calculate  this  defect  density  is  given  in  Appendix  B. 
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It  is  not  likely  that  defect  density  data  can  be  obtained  from  manufacturers  unless  required  by 
a  certification  program  such  as  the  Generic  Qualification  program.  For  this  reason,  it  is  imperative 
that  a  default  condition  be  defined  if  actual  defect  densities  cannot  be  determined.  The  following 
discussion  summarizes  this  default  value. 


Reference  6  discusses  the  following  effective  defect  density  (D^pp): 


^  «  fXof 

deff~d0  YZ 
\ 


where,  Dq  is  a  critical  defect  density  for  a  feature  size  of  Xq 
Xq  is  a  reference  feature  size 
Xs  is  the  actual  feature  size 


This  relationship  is  based  on  MIL-STD-2G9  which  states  that  tire  defect  density  increases  as 
lire  square  of  decreasing  feature  size. 

Although  there  is  soars  concern  as  to  this  assumption's  validity,  and  various  researchers  kave 
proposed  their  own  defect  density  distributions,  for  the  purposes  of  this  modal  die  M1L-STD-209 
approach  is  the  most  widely  recognized  and  will  be  used,  Tiiercfore,  the  default  value  of  D/DR 
2 

will  be  (Xq/Xs)  .  Since  die  average  feature  size  in  tire  database  from  which  the  failure  rate  were 
derived  was  2  micron,  diis  value  can  be  used  for*  Xq.  The  default  defect  density  is  therefore 

(2/Xs)2. 


5.1. 2.3.2  Effective  Areas  and  Device  Type  Relationships 

Since  oxide  area  lias  been  shown  to  be  directly  correlated  with  failure  rate,  devices  exhibiting 
different  packing  densities  will  result  in  significant  differences  in  reliability.  Ideally,  die  actual  gate 
oxide  area  would  be  i  as  an  input  into  the  reliability  prediction  model.  It  is  impractical, 
however,  to  require  that  it  be  known  since  a  relatively  detailed  design  analysis  of  die  chip  must  be 
done  to  detemuite  it  To  alleviate  this  need,  a  relative  sealing  factor  was  derived  for  various 
component  types  which  is  multiplied  by  the  total  die  area,  such  that  the  only  area  input  into  the 
model  is  die  total  die  area.  Since  the  average  die  area  of  devices  whose  data  was  used  to  derive  die 
model  is  .21  cm?,  the  device  type  correction  factors  for  metal  and  oxide  to  be  presented  in  this 
section  are  narcaalizad  to  typical  oxide  areas  and  metal  lengths  of  a  .2  lcrn^  chip. 
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To  derive  a  relative  scaling  factor,  of  particular  interest  was  the  difference  in  physical  features 
(i.e.,  metal  area  and  gate  oxide  area)  for  different  circuit  types  (gate  array,  RAM,  custom  chip). 
For  this  purpose  the  Yield  Analysis  Tool  (YAT)  was  used  to  determine  critical  areas  for  the 
different  chip  types.  The  YAT  is  a  software  tool  that  quantitatively  analyzes  circuit  features  on 
integrated  circuit  devices.  The  YAT  input  is  the  layout  database  of  the  integrated  circuit  device  and 
a  table  which  describes  the  circuit  features  to  be  analyzed.  The  histograms  output  describe  the 
distribution  of  features  sizes,  chip  maps  of  the  smallest  features,  total  area  of  selected  features, 
overlap  edge  length,  and  overlap  area.  This  tool  is  generally  used  for  yield  projection  and,  with  the 
appropriate  wafer  processing,  tire  identification  of  yield  inhibitors. 

For  the  purpose  of  this  analysis  YAT  data  for  four  different  CMOS  chips  was  analyzed  for 
metal  1  length  (less  than  3  microns  wide),  metal  2  length,  p  channel  gate  area,  n  channel  gate  area, 
and  metal  1  space  length  (the  total  length  of  parallel  first  metal  mnner  less  than  3  microns  apart). 
The  results  of  that  analysis  are  summarized  in  Table  5-6. 


TABLE  5-6: 
FEATURE  DENSITY 


Feature 

Units 

■SHI 

SRAM 

jplliSSfCTB 

— 

an/cnr 

390 

619 

289 

698 

mmm 

an/cnr 

291 

317 

400 

668 

IP  gate  area 

%  of  total  chip 

1.3 

1.9 

2.2 

2.5 

N  gate  area 

%  of  total  chip 

1.2 

l.S 

2,6 

2.6 

Metal  1  space 

cni/cnr 

511 

618 

743 

600 

It  should  be  noted  that  the  specific  SRAM  for  which  data  was  available  was  a  2Kx8  device  of 
modest  size  (.24cm^)  that  did  not  efficiently  use  the  pad  area  and  thus  may  have  a  more  modest 
density.  Also  the  gate  array  data  includes  unused  gates. 

To  better  understand  some  of  the  differences  in  Table  5-6,  a  more  detailed  analysis  was 
performed  on  the  metal  1  length  and  the  n  channel  gate  oxide  area.  This  detailed  data  is  shown  in 
Tables  5-7  and  5-8  respectively. 


TABLE  5-7: 

METAL  1  COMPARISON 


Feature 

Units 

1  1 

E3HS 1 

Custom 
Chip  2 

SRAM 

mgm 

Total  Length 

cm 

667 

r  '7 

210* 

914 

Average  Length 

micron 

12.8 

13.7 

4.3 

7.4 

%£1.6  micron  wide 

% 

48 

56 

13 

0 

%£2.6  micron  wide 

% 

53 

67 

29 

74 

Avc.  Length  1.6  micron 

micron 

23 

119 

17 

0 

Ave.  Length  2.6  micron 

micron 

12 

3.7 

11 

30 

*SRAM  is  1/4  size  of  other  chips. 
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TABLE  5-8: 

N-CHANNEL  GATE  OXIDE  COMPARISON 


N  Gate  Oxide 

Units 

1 

1 

1 

SRAM 

■gH 

1 

Instances  <10  microns^ 

Number 

1.5K 

18K 

49K* 

17K 

Instances  50-60  microns^ 

Number 

2.9K 

3.7K 

.9iJ! 

37  K 

Average  area 

Microns^ 

32 

35 

10 

40 

Total  Instances 

Number 

36K 

52K 

54K* 

63K 

Total  Area 

ern^ 

.0114 

.018 

.0053* 

.025 

*SRAM  is  1/4  size  of  other  chips. 

This  analysis  shows  that  custom  chips  may  not  be  as  densely  packaged  as  SRAMs  and  gate 
arrays.  The  gate  array  densities  listed,  would  generally  be  reduced  by  a  factor  that  may  approach 
the  percent  utilization  number.  If  that  were  done  then  all  chips  would  have  comparable  metal  1 
length  and  the  SRAM  would  have  the  densest  gate  oxide  percentage  and  close  metal  1  lines.  The 
analysis  indicates  that  the  density  is  design  dependent 
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A  presentation  at  the  1987  GOMAC  conference  by  engineers  from  United  Technologies 
Microelectronic  Center  (Reference  3)  tried  to  correlate  failure  rate  and  design  complexity.  Using 
life  test  data  from  gate  array  variations,  they  looked  for  correlation  to  traditional  measures  of 
complexity  such  as  number  of  inputs  and  outputs,  die  size,  and  number  of  transistor  pairs,  as  well 
as  new  bench  marks  such  as  metal  1  area,  metal  2  area,  and  coincident  metal  area.  UTMC  saw 
poor  correlation  between  the  number  of  I/O's  to  the  normalized  failure  rate  and  fair  correlation  to 
the  number  of  transistor  pairs.  They  demonstrated  good  correlation  (correlation  coefficients  of  .96 
to  .99)  for  the  failure  rate  versus  metal  1  area,  metal  2  area,  and  coincident  metal  area.  They  point 
out  that  the  life  tests  were  performed  on  early  3  micron  technology  parts  where  intermetal  dielectric 
integrity  was  the  predominant  failure  mechanism.  They  did  not  show  the  results  of  failure  rate  plots 
versus  gate  oxide  area,  number  of  vias,  and  number  of  contacts.  Clearly,  the  correlation  would  be 
dependent  oil  the  type  of  failures  observed.  The  data  seemed  to  indicate  that  a  part  with  much 
metal  1  would  also  have  much  metal  2,  much  coincident  metal,  and  much  gate  oxide  area,  such  that 
a  considerable  amount  of  data  would  be  required  to  properly  sort  out  the  effects. 

Since  the  data  in  the  database  (contained  in  Appendix  D)  is  from  a  good  cross-section  of 
device  types  (logic,  memory,  gate  arrays,  etc.)  the  failure  rate  expressions  developed  are  for  an 
"average"  device.  Therefore,  the  area  ratio  A/Ar  factor  only  has  to  be  a  relative  number  between 
the  various  device  types.  Table  5-9  summarizes  the  relative  area  factors  for  Custom/Logic  devices 
and  for  Memory/Gate  Array  devices.  The  Custom/Logic  average  percent  oxide  area  was  obtained 
by  averaging  the  sum  of  the  P  and  N  gate  areas  (in  Table  5-6)  for  Custom  Chips  1  and  2 
(((1. 3+1.2)  +  (1.9+1.8))/2=*3.1).  Similarly,  the  average  oxide  area  for  Memory/Gate  Arrays  is 
the  average  total  (P  and  N)  percent  areas  for  the  SRAM  and  Gate  Array.  A  similar  procedure  was 
used  to  derive  the  average  metal  length.  For  example,  the  average  metal  length  for  Custom/Logic 
devices  is;  ((390+291)  +  (619+317))/2  =  808.  Tire  Ajypg  factors  were  then  derived  simply  by 

normalizing  the  factor  to  1  for  an  "average"  device.  That  is;  4.95/3.10  =  1.23/.77. 


TABLE  5-9: 

RELATIVE  OXIDE  AREA  AND  METAL  LENGTH 
RATIOS  FOR  VARIOUS  DEVICE  TYPES 


Device 

Type 

Average 

Gate  Oxide  Area 
(%  of  Chip) 

> 

1 

♦Average 

Metal 

Length 

^TYPEmet 

Custom  & 

Logic  Devices 

3.10 

,77 

808 

.88 

Memory 

Gate  Arrays 

4.95 

1.23 

1027 

1.12 

♦Average  linear  length  of  first  and  second  layer  metal  per  chip  area. 

The  area  of  the  chip  can  then  be  modified  with  die  A1yP1^x  for  the  oxide  failure  rate  and  the 
ATYPEmet  ^or  mela*  ^ure  rate’  These  are  dimensionless  factors  which  indicate  for  oxide. 

that  memories/gate  arrays  typically  have  1.23/.77  «  1.59  times  the  oxide  per  area  as  do  custom  and 
gate  arrays. 

5.1.2.5  Oxide  Wearout  t^ 


The  primary  oxide  wearout  stress  related  acceleration  factor  have  been  determined  to  be 
temperature  and  electric  field.  Therefore  the  0$  of  the  iognoraial  can  bo  defined  as: 


*50' 


150ref 


lOX 


ox 


To  calculate  t^^p,  data  in  Table  5-10  was  extracted  from  the  literature. 
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TABLE  5-10: 

DATA  USED  TO  DERIVE  t50  CONSTANT 


Since  electric  fields  of  5  MV/cm  are  being  used  reliably  in  VHSIC/VHSIC-Like  devices,  it 
app&ass  that  this  data  is  not  representative  of  these  class  of  devices.  Therefore,  to  derive  a  *50reF> 

empirical  I5Q  data  at  high  electric  fields  was  used.  This  data,  not  presented  here  for  proprietary 

reasons,  was  from  high  field  accelerated  tests  performed  on  VHSIC-Likc  oxides.  Calculating  tnc 
•Aym  and  Ax0X  terms  under  the  conditions  of,  that  test  and  solving  for  tSOpjpF  yields  a  value  of 

1,3  x  I022  (T  O6  bn,).  This  yields  the  following  t5Q  time  for  TDDB: 


‘50os 


1.3  s  1022 


Empirical  data  front  the  literature  indicates  that  a  sigma  of  1  is  appropriate  for  these 
distributions  for  o&kks  under  nonnal  operating  conditions.  However,  various  investigators  have 
determined  that  the  sigma  is  directly  related  to  the  MTF.  (That  is,  as  the  MTF  goes  down,  so  does 
the  sigma).  Unfortunately,  there  is  not  enough  quantitative  data  available  to  define  an  accurate 
sigma  as  a  function  of  MTF.  Therefore,  unless  an  accurate  sigma  is  known  for  a  particular 
process,  the  value  of  l  should  be  used.  . 

Also,  in  the  Ijq  equation  to  be  used  in  the  model,  a  (QML)  factor  is  added  to  account  for  the 

improved  reliability  expected  from  the  procedure  taken  by  manufacturers  that  re  on  the  Qualified 
Manufacturer  List  (QML).  Hie  rationale  for  chosing  die  .5  to  2  values  for  tins  factor  are  given  in 
Section  7.1.4. 
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This  factor  is  also  used  in  this  form  in  the  150  expressions  for  metal  and  Hot  Carrier  wearout 
failure  rates. 

5.1.3  Oxide  Hazard  Rate  Summary 

It  has  been  shown  that  the  oxide  weaiout  hazard  rate  is  directly  proportioned  to  area  and 
defect  density.  There  is  no  reason  to  believe  that  both  the  early  life  and  wearout  terms  will  not  be 
accelerated  as  a  function  of  area/defect  density.  The  form  of  the  oxide  failure  rate  is  therefore: 

^ox  =  Ag  ^TYPEox  ^Early  Life  +  ^Wearout) 

where: 

=3  .21  cra^,  the  reference  chip  area 
=  1  ,  the  reference  defect  density 

The  entire  detailed  oxide  hazard  rate  is  then  given  as  follows: 


Xox  (in  F/106) 


A  AtypHqx  /Do, 

w 


VR 


$)[<■ 


0788  c 


-7.7 


*°)  (A_.  )(e 

*  ox 


-7.7ATo)tt 


A  =  Total  Grip  Area 

^typEq*  ”  *7?  Custom  and  Logic  Devices 

=  1.23  tor  Men»ries  and  Gate  Arrays 

AR  =  .21  cm^ 


88 


D0ox  =  Defect  Density  calculated  by  using  the  procedure  of  Appendix  B 

fXQS2 

(if  unknown,  use  I  x^j  where  Xq=  2  pm  and  Xs 


Dr 

^0 


At 


is  the  feature  size  of  the  device) 

1  defect/cm^ 

Effective  Screening  time 

(Actual  Time  of  Test  in  10^  lirs.)  *  ATqx  AVqx 
(where  A-r  and  Ay  are  the  values  during  screening) 


ox 


=  Temperature  Acceleration  Factor 


ox 


r  --3  f 

1  1  Y1 

L8.63xlO-5  V 

IJ’SSjJ 

Tj  »  Average  junction  operating  temperature 
*Tc  +  9jCP(in  °K) 

-192  f -  Kpr'] 

s#  e  v^ox 

«=  The  maximum  power  supply  voltage  (VLO)  divided  by  the  gate  oxide 
thickness  (in  MV/an) 


‘ox  vox 


"ox 


QML  *  2  if  on  QML,  .5  if  not 

-  Sigma  obtained  from  test  data  of  oxide  failures  horn  tltc  same  or  similar 

pax  ess.  If  not  available,  use  a  value  of  1 . 


Time  (in  10^  Hours) 
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5.2  METAL  FAILURE  RATE 


5.2.1  Metal  Early  Life-Failures 

As  in  the  case  of  oxide,  the  early  life  failure  rate  due  to  metal  was  derived  using  the 
methodology  outlined  in  Section  3.0.  Table  5-11  summarizes  the  data  used  in  this  derivation. 
Given  in  this  table  is  the  effective  time  interval  (at  25°C),  the  total  accelerated  part  hours  using  the 
temperature  acceleration  factor  with  an  activation  energy  of  .55  eV,  and  the  total  number  of  metal 
failures  observed  in  the  time  interval. 


TABLE  5-11: 
METAL  FAILURE  DATA 


Equivalent  Time 

Interval  at  25  °C 
(106  hrs.) 

Accelerated 

Part  Hours 
(106  hrs.) 

Number  of 

Failures 

0-0.059872 

16468.541808 

3 

.059873  -  0.209553 

40975.894310 

6 

.209554  -  0.623669 

77953.612970 

9 

.623670  -  1.248337 

111  354.649173 

36 

Table  5-12  presents  the  summarized  data  used  in  the  derivation  of  this  factor. 
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TABLE  5-12: 

METAL  EARLY  LIFE  SUMMARIZED  FAILURE  DATA 


Effective 

Range  (10^  hrs.) 

Effective 

Part  Hours  (10^) 

Number  of 

Failures 

Failure 

Rate  F/106 

0  -  .059872 

.059873  -  .623669 

.029936 

.36177 

n 

3 

15 . 

.000182 

.000126 

For  the  data  in  this  table,  the  failure  data  in  the  range  0.623670  -  1.24833?  was  excluded 
since  the  36  failures  observed  were  detected  only  at  a  single  time  in  that  interval  and  the  time  of 
failure  was  not  known.  All  that  was  known  was  that  the  36  failures  occurred  less  than  1,248337. 
This  data  therefore  could  not  be  used  to  identify  failure  race  dependency  on  time. 

Deriving  the  metal  factor  with  a  simple  regression  solution  of  the  above  dam  arid  adjusting  the 
T  to  match  the  observed  cumulative  failure  rate  yields: 

XMET  (t>  *  .00043  e“U8{ 


Adjusting  the  constant  .00043  to  account  for  tire  unknown  Allures,  and  adding  the 
atxcleratiotts  due  to  temperarne  and  screening  yields  the  following  metal  failure  mte: 


'•MET  kTiy  Life 


.00102  e’US  10  ATwo'US  Atmet  t 


5.2.2  MsifllBaEM 

Electroaugration  is  the  predominant  racial  wcarout  failure  mechanism  and  its  probability 

density  function  is  modeled  with  a  lognonual  time  to  failure  distribution.  The  mean  time  to  failure 
(c50>  ^  <^ts0  by  Black's  equation: 


{5G®  AjjJ  *n  exp  ^ 


Ea 


To  detemiins  the  value  of  Ag  that  should  be  used  in  this  expression,  the  dataset  in  Tabic  5- 1 3 
was  used. 


TABLE  5-13: 

SUMMARY  OF  OBSERVED  PARAMETER  VALUES  FROM  THE  LITERATURE 

(all  Lognormal  Distribution) 


Based  on  this  summary  of  information,  representative  values  of  n  and  the  activation  energy 
are  2  and  .55  eV  respectively.  With  this  information  tlie  tjQ  expression  becomes: 


Ab  (METAL  TYPE) 

t50= — P“aT - 

J  atmet 


where: 

Ag  (METAL  TYPE)  *  HBaye"  t^o  as  a  function  of  the  metal  t ype  used 

Using  the  data  in  Table  5-12,  the  constant  Ag  was  derived  for  Aluminum  metallization  and 

turns  out  to  be  .39  million  hours.  For  Al-Cu  tnetal  this  constant  was  calculated  to  be  14.5  million 
hours,  or  37.5  times  the  constant  for  aluminum.  Therefore,  the  (METAL  TYPE)  factor  is: 

(METAL  TYPE)  *  1  for  Alwmaum 
«  37  J  for  Al-Cu 

Towner  m  al,  (Reference  1 2)  concluded  front  empirical  cfectmmignuion  testing  of  aluminum 
conductor  line^-fhat  the  mean  time  to  failure  (&gj)  increases  m  tins  inverse  square  of  the  duty  cycle 

of  pulsed  operation  Hot  is: 

ISO  (constant  DC) 

'  (pulsed  DC)  *•*< - —-ag — * - 

where: 

r®  duty  cycle 

‘  Since  the  etfda&on  developed  was  based  ea  constant  current  conditions,  the  actual  duty 

cycle  should  be  accounted  for.  'Therein  w&e»  cafcatedag  the  current  density  (J)  to  be  used  in  the 
tS%BT  c4luaii0|L  the  average  absolute  value?  should  be  used 

.  Also,  investigated  for  the  etecconugrmon  £50  model  was  the  use  of  a  testing  effectiveness 

factor-  ElccacKiugration  ^ddMtures  .m  wdy  repotted  and  rdectrondgradon  bnm-in  failures  are 
even  i&rer,  although  they  have  begn  reposted  (Reference  57).  It  is  generally  thought  that  if  tire 


screen  or  bum-in  detects  any  electromigration  failures,  then  there  will  be  many  more  field  failures 
soon.  Without  wafer  level  tests,  it  is  difficult  to  conceive  a  screen  that  is  effective  for 
electromigration. 

The  use  of  an  area/defect  density  factor  was  also  investigated  for  metal  wearout  as  it  was  in 
the  oxide  wearout  case.  Modeling  electromigration  is  not  as  straight  forward  as  in  the  oxide  case. 
There  have  been  a  number  of  studies  which  found  the  electromigration  lifetime  related  to  the  line 
length  to  some  critical  length  which  most  likely  is  related  to  line  width  (References  48, 49, 50, 51). 
The  idea  here  is  that  me  failure  of  a  line  is  dependent  upon  the  existence  of  a  worst  case  grain 
boundary  situation,  and  that  a  line  c  ?  a  critical  length  has  a  high  probability  of  having  such  a 
situation  and  therefore  has  a  certain  tendency  to  fail.  A  longer  line's  tendency  to  fail  is  not 
significantly  greater.  This  critical  length  for  near  micron  lines  is  probably  i"  the  order  of  1000 
microns  (Reference  5).  Since  VLSI  devices  have  meters  of  minimum  width  metal  films,  this 
approach  suggests  that  die  electromigration  failure  rate  factor  is  essentially  the  same  for  all  large 
chips  with  a  particular  metallisation  process  using  a  particular  set  of  design  rules. 


For  this  reason,  the  area/defect  density  factor  will  only  accelerate  the  early  life  metal  failure 
rate  and  mi  the  dccsomlgratiors  case.  The  metal  defect  density  to  be  used  can  be  calculated  as 
given  in  Appendix  B. 
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5.2.3 


Tlie  model  form  of  die  metal  failure  rate  is: 


A  ^OmF.T 


ATYPEmeT  ^MET  Early  LifP  +  ^MET  Wearout 


The  complete  detailed  metal  failure  rate  is  therefore  given  as  follows. 


H  r  A  ^OmET  *  /  m\i  nn  .-1.18  tfk  /  a  s  /.-1.18  ATnjcrte 

%ET  =  [  jq  ~D^“  ATYPEMET  (-00102  6  }  (ATMET)  (e  ] 


.  r  .399  f  -.5 

+  . .  -  w^p  I  . . .  - . ! 

(t+tQlCf^ET  [^MET 


(t+tQ)  -  In  ^Omet)2 


A  =  Toted  Chip  Area  (in  cm2) 

ATYPEm_  -  .88  for  Custom  and  Logic  Devices 


1. 12  for  Memory  and  Gate  Arrays 


.21  cm 


}0met 


Defect  Density  calculated  using  the  method  in  Appendix  B. 

(If  unknown  use  where  Xq  =  2  pm  and  X$  is  the  feature 
size  of  the  device) 


1  defect/cm 
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SffiT  =  Temperature  Acceleration  Factor 


'o 


T,  =  W  +  eJCP  (i=°K) 

Effective  Screening  Time  (inlO^hrs.) 


=  (at  Screening  Temp.)  *  (Actual  Screening  Time  (in  lO6  hrs)) 

(to  Calculate  use  Aj^,  based  on  the  junction  temp,  during  screening) 


^Qmet 


(qML)-^4^«£M 

1  V 


IBT 


(in  106  hrs.) 


(QML)  «  2  if  on  QML,  .5  if  not 


Metal  Type  =>1  tor  A1 


37.5  or  Al-Cu 


37.5  for  Al-Si-Cu 


J  **  The  menu  absolute  value  of  Metal  Curran  Density 
(in  10^  Acaps/cm^) 

°MEl  “  ^  w*  on  elcctromisrurion  fcclurw  torn  the  saute 

or  a  similar  process.  If  this  data  is  not  available  use  0^“  1. 

t  *>  time  (in  10^  hrs) 
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5.3  HOT  CARRIER  DEGRADATION 

It  is  well  known  that  hot  carrier  effects  can  degrade  MOS  transistors  with  short  channel 
lengths.  This  degradation  (Reference  68)  can  b?  expressed  as: 

transistor  parametric  shift  =  Arc1” 


Where  t  is  the  stress  time,  and  the  power  factor  of  time  "n”  changes  according  to  hot  carriers 
injection  mechanisms  (therefore  varies  according  to  the  device  structure)  but  generally  falls  in  the 
range  of  .25  to  .75  (Reference  68,  69).  Tiie  constant  term  '*AHC"  represents  the  magnitude  of 

device  degradation  and  is  related  to  ths  drain  voltage  (Vd)  by: 


Afic  ”  P  exP  Hx/Vd) 

Where  "P"  is  some  constant  of  proportionality.  By  plotting  the  log  of  the  transistor 
parametric  shift  versus  the  log  of  time,  the  constant  term  "AHC"  and  the  power  factor  term  "n"  can 

both  be  determined.  By  plotting  the  log  of  A  versus  1/Vd  the  value  of  "a"  (slope)  and  "p"  (y* 
intercept)  can  be  da;  nrnincd  at  different  drain  voltages.  The  first  equation  above  can  then  be 
reanauged 


Witli  known  values  of  "AHCM  and  ii”  the  mne  T  for  a  given  parametric  shift  cun  then  be 
estimated 
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Another  method  of  predicting  device  lifetime  used  by  other  investigators  is  to  monitor  the 
device  substrate  current  (I^b).  Device  lifetime  "xHC"  has  been  shown  to  be  related  to  the  substrate 

current  by  the  relationship: 


%c=c*asub)-m 

Where  C  is  a  constant  dependent  on  parameters  such  as  drain  current,  etc.,  and  the  power 
factor  W  fails  in  the  range  of  2.7  -  3.2  (Reference  68,  69).  Since  the  substrate  current  is 
relatively  easy  to  measure,  it  has  the  potential  of  being  used  as  way  of  monitoring  sensitivity  to  hot 

carrier  effects  on  suitable  test  structures  at  the  wafer  level.  The  degradation  due  to  hot  carriers  is 
also  a  strong  function  of  temperature  and  frequency,  winch  are  related  to  Isu-0.  If  Isu<0  can 

accurately  be  determined,  the  effects  of  temperature  and  frequency  will  inherently  be  accounted  for. 

Previous  analysis  has  concentrated  on  measuring  the  shift  in  the  threshold  voltage  when  the 
transistor  is  in  the  linear  or  ohmic  region  (VDI)  =  lOOmV),  or  a  shift  in  the  maximum  drain  current 

(Vos,  =  5v)  with  the  source  and  drain  connections  reversed.  In  either  case  the  transistors 
must  be  biased  at  maximum  avalanche  hot  electron  (AHE)  stress  condition  (typically  VGS  =  3.5v, 
VDS  «  6.4v)  for  a  minimum  of  8  hours  to  predict  the  transistor's  sensitivity  to  hot  carrier 

degradation.  A  recent  study  found  that  the  above  equation  holds  not  only  in  static  stress  but  also  in 
dynamic  stress.  Where  the  device  lifetime  can  be  written  by: 

X  dynamic  *»  C  (I^g,  (Reference  71) 

Where  lgtib»  peak  p&k  va*ue  oi  substrate  current  and  R  is  duty  ratio  for  the 
substrate  current  pulse. 

Also  investigated  was  the  possibility  of  ash  g  measured  at  the  wafer  level  on  a  process 

monitor  test  structure  as  a  means  of  predicting  devise  lifetime  due  to  a  hot  carrier  degradation.  A 
hot  carrier  test  found  the  low  dose  drain  (LDD)  n-channel  transistor  was  loss  sensitive  to  hot 

carrier  effects  when  compared  to  a  single  drain  <SD)  transistor  structure.  The  peak  substrate 
current  at  »  5.5  volts  was  measured  at  boom  tempetuture  on  20  LDD  and  20  SD  10/1.0  n- 

channel  transistors  before  the  hot  carrier  test  The  results  were: 
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1 

^sub 

Mean 

StdDev. 

UDD 

3.54 

0.19 

SD 

9.97 

0.52 

Thus  there  is  a  good  possibility  that  substrate  current  meas’rrements  at  the  wafer  level  may 
provide  a  fast  and  reliable  method  to  model  hot  carrier  degradation. 

A  hot  carrier  degradation  model  of  this  type  does  not  address  a  particular  failure  mode,  and 
docs  have  different  impacts  on  memory  chips  and  logic  devices.  This  hot  carrier  model  is  v  "ended 
to  indicate  how  a  transistor's  parameters  degraded.  This  model  form  is  generally  accepted, 
however,  what  is  difficult  is  to  judge  is  how  that  transistor  degradation  affects  circuit  performance 
for  a  memory,  digital  or  analog  applica,  on.  While  a  manufacturer  could  model  this,  the  results  are 
not  likely  to  be  shared.  The  specific  application  of  a  transistor  is  most  important.  For  example  a 
pass  transistor,  where  die  current  may  go  backwards  from  the  drain  to  source,  will  be  affected 
more  tei  an  on/off  transistor  in  an  inverter  gate. 

It  is  the  judgement  of  the  antes  at  this  time  that  either  a  product  has  a  hot  electron  problem 
or  it  does  not  Thus,  the  model  should  be  very  sensitive  to  modest  changes  -  i.e„  the  failure  rate 
may  move  into  the  FIT  range  very  quickly  if  some  threshold  is  exceeded.  Otherwise,  the  failure 
rate  is  in  the  fractional  FIT  range. 

The  possibility  of  using  substrate  current  measurements  at  the  wafer  level  to  provide  a  fast 
and  reliable  method  to  model  hot  carrier  degradation  is  intuitively  appealing.  Other  researchers 
have  also  written  of  this  goal  and  the  discussion  below  involves  some  measurements  that  have 
been  taken  to  investigate  this  matter. 
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The  data  in  Table  5-14  shows  Isuj,,  @  Isu^,  and  the  gate  voltage,  all  with  VDD  =  5.5 

volts  at  3  temperatures.  All  six  10/1  transistors  experienced  the  hot  carrier  test  at  5.5  volts.  Note 
that  the  Vg  values  at  which  Isujj  occurs  changes,  indicating  the  hot  carrier  damage  affects  the 

maximum  electric  field  in  the  channel.  The  300/1.2  transistors  were  not  stressed.  For  both 
transistors  sites,  the  Isui>/Idd  °f  the  single  drain  (SD)  device  was  approximately  2x  higher  at 

all  temperatures.  From  the  data  we  should  be  able  to  (1)  correlate  projected  life  times  to  room 
temperature  Isuj,  data,  and  (2)  determine  the  temperature  dependence  of  1^5. 


As  previously  shown  tbs  device  lifetime  Mt"  due  to  hot  carrier  degradation  has  been  modeled 
as: 


t=c*Wm 

Where  C  is  a  constant  dependent  mainly  on  the  transistor's  response  to  drain  bias  conditions, 
Isujj  is  the  substrate  current,  and  die  power  factor  term  "m"  is  equal  to  where  4qt  is  the 

critical  energy  to  create  an  interface  trap,  and  ^4  is  the  impact  ionisation  energy.  However,  this 

model  does  not  account  for  the  dependence  of  substrate  current  generation  on  drain  current,  which 
can  he  modeled  as  (Reference  69): 


Where  C  is  a  constant,  is  die  drain  current,  X  is  the  mean  free  path  for  elections,  and  E,tl  is 

the  maximum  electric  field  in  the  channel.  Therefore,  a  better  expression  for  modeling  device 
lifetime  would  be  (Reference  72): 
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TABLE-5- 14: 

lyjB  VS.  TEMPERATURE 


RiOOH  rSM/SRATuaS  SATA 


-3  C  DATA 


C  DATA 


U» 


U-2-3  }  3S0/I.3 
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i 
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! 1 . 016-02  i 
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s 
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1 
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1 
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1 
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3 
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In  1987  a  hot  carrier  test  was  performed  at  Honeywell  on  discrete  10/1.0  and  10/1.2  um  n- 

channel  transistors  with  a  Lj-,d  drain  structure,  and  250$.  gate  oxide.  The  transistors  were  biased 
for  maximum  avalanche  hot  electron  current  with  VD  =  6.0  and  7.5  volts  at  77°K.  Due  to 
processing  differences  between  test  structures  from  3  wafers,  there  were  substantial  differences  in 
substrate  current  for  same  size  transistors. 

As  mentioned  previously,  there  is  disagreement  as  to  what  should  be  the  failure  criteria  for  a 

discrete  transistor.  For  the  purpose  of  this  test  the  lifetime  was  defined  as  a  10%  decrease  in  die 
integral  of  as  VSD  is  swept  from  0  to  5.5  volts  with  VGS  =  5.5  volts.  This  criteria  (which  we 

call  "area  under  the  curve")  was  chosen  since  it  monitors  the  transistor's  response  in  both  the  linear 
and  saturation  regions  of  operation. 

Figure  5-18  is  a  log  normal  plot  of  the  cumulative  failures  versus  time  for  19  parts  from  the 
test.  From  tins  plot  it  is  apparent  the  hot  carrier  failure  mechanism  follows  a  lognormal 
distribution,  and  the  presence  of  a  "sport"  population  is  evident  The  sigma  for  this  test  is 
approximately  1.1.  Figure  5-19  is  a  plot  oj.  log  of  x  times  die  drein  current  versus  the  log  of 
ratio  of  divided  by  1^.  The  power  factor  **m"  is  equal  to  2.5,  which  is  in  good  agreement  widi 

published  values.  The  following  listing  summarizes  published  values  for  this  exponent; 


Author 

Affiliation 

Year 

BBSS® 

Reference 

Takedactal 

Hitachi 

1983 

3.2  -  3.4 

68 

Huetal 

CalBcrfcdy 

1985 

2,9 

69 

Tzotmal 

AMD 

W85 

2.7 

73 

Horiuchi  ct  al 

NEC 

1986 

2.5 

71 

Weber 

Siemens 

1986 

2.9 

8^ 

Kriegeretal 

VLSI 

1988 

2.9 

84 

T ten 

AT&T 

1987 

2.9  -  3.2 

85 

;  Qteaetal 

AT&T 

1987 

3.1 

86 

Duvmryetal 

T.S. 

1987 

2.8 

87 

ffdiensetal 

IMEC 

1988 

2.7 

88 

Weber 

Siemans 

198S 

2.9 

89 
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FIGURE  5-18: 

LOGNORMAL  PLOT  OF  FAILURES  DUE  TO  HOT 
CARRIER  DEGRADATION  VS.  TIME 


•c  HOT  CRRRIER  TEST 
■'  i  B>:  CHfiHO r  IN  flfui 
LRSUS  Zsub/Idd  _ i 


-\  Is  is  important  to  note  that  the  model  should  account  for  temperature  effects.  One  research 
effort  (Reference  73)  found  that  the  constant  term  A  in  the  previous  equation  changes  with 
temperature  according  to  the  equation: 

Arc  =  Ao  exP  (--039  eV/kT) 

Where  k  is  Planck's  constant  The  drain  current  1^  and  the  substrate  current  Isu{>  also  change 
at  different  rates  as  a  functior  T  temperature;  in  general,  our  data  indicates  Isuj,  increases  more  at 
lower  temperatures  than  1^.  Using  this  methodology  to  predict  lifetime,  the  data  summarized 

below  indicates  the  impact  temperature  has  on  device  LT ;time  due  to  hot  carrier  degradation. 


Parameter 

77°K 

298°K 

constant  term  "AHC ' 

4xl0*& 

drain  current 

3.11mA 

substrate  current  ISU5 

3.44pA 

0.75pA 

power  factor  term  V 

2.5 

2.5* 

lifetime 

3.B  hours 

*  a  assumed  car  calculated  values. 

Based  on  this  data,  a  150  for  the  lognormal  relationship  can  be  defined  as  follows: 


’50  “a 


C  AubV2-3 


thc 


Td|Tdj 


Where  C  is  a  constant  derived  to  match  empirical  data.  Deriving  C  from  die  above  data  yields 
a  value  of  3.74x10*^, 


i 

| 
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If  actual  values  of  Id  and  are  not  known,  there  must  be  provisions  in  the  model  for 

default  values.  Based  on  the  measured  values  presented  previously,  die  following  default  currents 
can  be  used  for  10/1.2  (widtk/leugth  in  microns)  N-channel  transistors: 

Id  =  33  c’*0069  Tj  (mA) 


Isalj « .0058  e 


-.00157 


Tj  (mA) 


Hot  carrier  degradation  is  strictly  a  wearout  mechanism  and  therefore  should  contain  no 
coAUibatkai  to  the  early  life  failure  rare.  Also,  since  this  phenomena  effects  all  transistors,  it  is  not 
area  or  defect  density  dependent  Therefore,  the  final  failure  ime  contribution  from  hot  carriers  is 
the  following; 


=*  Drain  Current  at  Operating  Temperature.  If  unknown  use 
*$ub  “  *  '-C0157  Tj 

Current  at  Operaiiag  Temperature,  tf  unknown  use 


tj^  .  w  ^*Thc  &  Temp.)  *  (Test  Duration) 

t  »  time  (in  10^  las.) 
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5.4  CONTAMINATION 


Aldiough  the  physics  of  failure  characteristics  of  contamination  and  methods  of  control  are 
fairly  well  understood,  many  failures  due  to  contamination  appeared  in  the  data  collected  for  this 
program.  Due  to  this  observance,  a  failure  rate  for  contamination  was  derived  and  included  in  the 
model.  Here,  the  contamination  failure  class  is  defined  as  any  type  of  contamination,  ionic  or 
other. 


The  prevalence  of  contamination  related  failures  tend  to  be  highly  dependent  on  the  particular 
fabrication  processes  and  thus  the  failure  rare  model  contained  herein  is  an  industry  wide 
representative  value,  ard  can  vary  widely  from  manufacturer  to  manufacturer.  Contamination  is 
strictly  an  early  life  mechanism  and  contains  no  wearout  contribution.  Like  hot  carriers, 
contamination  failures  are  not  area  or  defect  tensity  related  and  therefore  do  not  have  these  terms  in 
the  failure  rare  equation.  They  are  typically  easily  screened  due  to  their  high  activation  energy  and 
temperature  acccteration. 

Table  5-15  stummriaas  the  data  used  in  the  derivation  of  tire  contamination  failure  rate,  la 
this  table  the  firm  column  it  the  equivalent  titae  interval  at  25°C  obtained  by  multiplying  the  actual 
tires  by  the  acceleration  due  m  temperatm^,  tmng  sit  activation  energy  of  1.0  eV,  A  value  of  1.0 
dV  L  vkfely  accepted  in  the  sas^xtnduaosr  industry.  “Tits  second  column  Is  tire  equivalent  number 
«C  total  part  hours  in  the  interval  using  tire  same  acceleration  due  to  rempmuture,  and  the  last 
column  lists  die  number  of  failures  in  duu  time  interval 


TABLE  545: 

CONTAMINATION  DATA 


Equivalent!  tore 

Interval  at  25°C  (in  10^  his.) 

Number 

of  Failures 

1  ■■■■■■nBMIdtxfxiuflll 

muwmmmiwm 

1 

20.431620  -  71.510666 

14006588.472287 

1 

71,510667-  212.829365 

2708 8 162.036 146 

> 

212.829366-  228.845835 

3064959.151818 

2 

228.845836  -  425.658727 

35348454.984858 

595 

425.658728  -  2724.355183 

20233992.578 138 

1 

2724.255184  -  6994.508872 

6635426.324593 

1 

Table  5-16  presents  the  summarized  data  used  The  first  five  data  points  from  Table  5  15 
were  combined  for  the  first  entry  in  Table  5-16  and  the  sixth  and  seventh  were  combined  for  the 
second  in  Table  5-16.  Regressing  on  these  two  datapoints  yields  the  following  failure  rate: 

^gn  *  9,36  x  10*6  C',0G28 1 
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TABLE  5-16: 

SUMMARIZED  CONTAMINATION  DATA 


Failure 

Tune  Interval 

Number 

Rate 

(106  to.) 

■KSISHi55Bi 

Failures 

0-  425.658727 

212.0 

85,128,066 

601 

7.1xl0*6 

425.658728  -  6994.508872 

3709 

26,869,418 

2 

7.4X10"8 

Accounting  for  the  unknown  failures  and  accelerations  due  to  temperature,  duty  cycle  and 
screening  dan©,  the  data  in  Table  5-16  yields  the  following  equation: 


k:oM  * 


-.0028 At  ... 

C  CON 


t 


*TCW  “  Ac«fcmto  taor 


iln'K) 

t0  w  Effective  Scttsenkg  Tusas 

«  (at  screwing  junction  tesaptastme)  *.(ciiuiii  screening 

litt»  lit  #  to.) 

t  »  time  (tn  10^  to.) 
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5.5  PACKAGE  RELATED  FAILURE  RATE 


To  develop  a  model  for  package  related  failures  (Le.,  package,  lead,  die  bond,  interconnects, 
etc.)  the  assumption  was  made  that  environmental  stresses  (i.e.,  temperature  cycling,  thermal 
shock,  vibration,  mechanical  shock)  accelerate  package  related  failure  mechanisms  to  a  much 
higher  degree  than  die  related  failure  mechanisms.  Since  these  environmental  stresses  are  event 
related,  they  can  be  modeled  with  an  exponential  time  to  failure  distribution  and  therefore  a 
constant  failure  rate  can  be  derived  The  data  presented  previously  in  the  database  profile  supports 
this  approach. 

5.5.1  Screening  Factor 

Since  there  is  no  field  experience  data  available,  the  approach  used  to  develop  field  failure 
rates  for  VHSIC/VKSIC-Like  package  sty.^s  was  to  derive  a  tailure  rate  for  lower  complexity 
package  types  for  which  field  experience  is  available  and  multiply  that  failure  rate  by  the  ratio  of 
fallout  rates  observed  between  VHSIC  package  types  and  MSI/LSI  package  types  for  which  field 
experience  is  available. 

Many  package  defects  can  be  screened  effectively  and  therefore  a  strong  relationship  between 
reliability  md  qu  y  should  be  expected.  The  approach  used  in  the  package  failure  rate 
development  was  therefore  to  make  this  failure  rate  a  function  of  quality  (screening  level)  and 
environment.  The  premise  of  this  factor  is  thtt  the  nost  screened  percent  defective  in  a  given 
population  is  equal  to  the  initial  percent  defective  minus  the  percent  fallout  after  the  population  is 
exposed  to  a  screen  (or  screening  sequence),  and  the  field  failure  rate  expected  from  a  particular 
package  style  is  directly  proportional  to  the  fallout  rate  observed  for  that  package  when  exposed  to 
environmental  testing. 

The  following  screening  level  categories  and  associated  observed  fallout  percentage  were 
used  to  derive  the  package  factor  (Reference  76). 


None 

0% 

Burn-in 

.36% 

Environmental  Series 

1.37% 

Bum-in/Environraental  Series 

1.66% 
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These  arc  general  categories  and  are  representative  of  data  from  a  variety  of  sources  and  test 
conditions.  They  are  typically,  but  not  limited  to  MIL-STD-883  tests.  In  this  context  Bum-in  is 
normally  a  high  temperature  (125°C),  short  duration  (160  hours)  test  designed  to  identify  defective 
parts.  Environmental  tests  are  intended  to  be  representative  of  various  environmental  screens  such 
as;  temperature  cycling,  shock/vibration,  humidity,  etc.  As  an  example  Quality  Level  B  devices  are 
subjected  to  both  burn  in  and  environmental  tests. 

These  broad  categories  were  chosen  because  fallout  rate  data  for  specific  screens  loses  its 
statistical  sigxsdfieanes.  Additionally,  many  tests  are  performed  in  a  series,  making  it  impossible  to 
identify  the  fallout  rare  of  each  one. 

Assuming  that  the  test  effectiveness  for  package  related  mechanisms  is  directly  proportional 
to  the  fallout  rate  percentage  for  a  particular  test  or  series  of  tests,  and  that  the  test  effectiveness  for 
a  burn-in  with  environmental  series  is  in  the  SQ-iQC  -ft  range  (or  approximately  90%),  the  test 
effectiveness  along  with  a  relative  failure  rate  correction  factor  for  package  screening,  are 
summarized  in  Table  5-17.  This  yields  a  10:1  ratio  in  the  expected  package  failure  rate  between 
Class  B  and  D  devices. 


TABLE  5-17:  ' 

PACKAGE  TEST  EFFECTIVENESS  AND  SCREENING  FACTOR 


Screening  Level 

Class 

Test  Effectiveness 

nSP 

None 

D 

0 

10 

Bum-in 

* 

20 

s 

Enviramaental  Series 

to 

72 

2.8 

Burn-in  with  Environmental  Series 

B 

90 

1 

. .  .  . . 

Once  the  test  effectiveness  has  been  derived,  the  package  failure  rate  can  be  defined  as 
follows: 

^package  *  nSP  nE 


no 


where: 


XBP  =  base  package  failure  fate 
FQj^  =  reference  fallout  rate  for  MSI/LSI 

FO  =  observed  fallout  rate  for  VHSIC/VHSIC  like  packages 
%  =  environmental  factor 

nSP  =  package  screening  factor 

The  10:1  ratio  of  package  failure  rates  is  applicable  for  the  environments  where  it  would  be 
expected  that  most  of  die  defective  parts  would  be  accelerated  to  failure.  In  a  benign  environment, 
it  would  not  be  expected  that  defective  parts  would  be  accelerated  to  failure  and  therefore  there 
would  be  less  benefit  to  screening  parts  used  in  benign  environments.  However,  without  data  to 

the  contrary,  and  since  the  package  failure  rate  is  normalized  to  unscreened  devices  used  in  benign 
environments,  a  worst  case  failure  rate  ratio  between  quality  levels  forGg  is  also  10:1,  and  treated 

as  an  independent  factor  from  environment. 

h.J.2  MssaomEm 

XBP  can  now  be  derived  from  empirical  field  failure  rate  data  and  failure  mode  distributions 
(percent  of  failures  due  to  tire  package).  To  accomplish  this.  Reliability  Analysis  Center  data 
(Reference  75)  on  MSI/LSI  devices  was  used  to  calculate  an  average  IC  failure  rates  for  these 
combinations  of  screen  class  and  environment  These  failure  rates  are  suramamed  in  Table  5-18. 

TABLE  5-18: 

AVERAGE  X  AS  FUNCTION  OF  ENVIRONMENT 


Screen  Class 

Enviro  nent 

Failure  Rate 

Buro-m/En  vu  onnrent  (B) 

aif 

.285 

Burn-in/Environment  (B) 

Auf 

.441 

Worn*.  (D) 

Gb 

.372 

These  environments  and  screen  classes  were  chosen  since  there  existed  good  quality  data  in 
these  categories  and  also  to  provide  a  good  mix  of  vibration  and  temperature  cycling  stresses.  The 
exact  stresses  in  each  environment  can  not  be  specifically  identified  and  therefore  the  environments 
must  be  qualitatively  defined  as  in  the  current  MIL-HDBK-217  models. 

Next,  failure  mode  distributions  were  used  to  identify  the  percentage  of  failures  due  to 
package  related  failure  mechanisms.  For  this  the  following  data  extracted  from  Reference  75  was 
used; 


iPlte 


IMmk 

I&ata3£ti£ 

Digital 

60.9 

47.2 

Linear  &  interface 

19.9 

23.7 

Memory 

5.3 

20,3 

VLSI 

7.1 

14.3 

From  this,  it  was  determined  that  25%  of  IC  failures  are  typically  due  to  the  packages,  indicating 
that  the  base  failure  rate  would  be  25%  of  the  above  listed  numbers.  These  package  failure  rates 
are  given  in  Table  549. 


Using  the  package  failure  rates  in  Table  549  in  conjunction  with  the  quality  factors  (Rgp) 
given  in  Tabte  $47,  the  relative  envaotanental  factors  for  Ag*.  A^p,  and  GB  environments  can  be 
defined.  This  was  accomplished  by  making  die  product  of  HSP  and  Og  proportional  to  the 

ptekage  faUurerate,  These  relative  environment  factors  mu  presented  hi  Table  5-20,  normalizing 
Gfi  to  one. 
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TABLE  5-19: 

PACKAGE  FAILURE  RATE  AS  FUNCTION  OF 
ENVIRONMENT  AND  SCREEN  CLASS 


Environment 

■gsroffliraaiEM 

Ap 

i 

B 

.071 

Auf 

B 

.110 

D 

.093 

These  ratios  hetwefi  GB,  Ajp,  and  A^p  environments  factors  are  relatively  consistent  with  the 
environmental  factors  currently  in  MIL-HDBK-217E.  Therefore,  assuming  the  relative  nE's 

be  tween  environments  in  217E  ate  valid,  the  values  for  all  environments  are  defined  in  Section 
3.5.8. 

TABLE  5-20: 

ENVIRONMENT  FACTOR 


5.5.3  Package  Type  Factor  (Tlpf) 


The  values  in  Table  5-20  correspond  to  the  lower  complexity  devices  from  which  they  were 
derived.  They  now  can  be  modified  to  the  higher  complexity  VHSIC/VHSIC-Like  device 
packages  via  the  fallout  rate  ratio 

For  a  typical  MIL-STD-883  Class  B  screening  sequence  for  a  DIP  package,  RAC  data 
(Reference  76)  indicates  a  typical  fallout  percent  of  1.66%  of  which  25%  of  these  failures  are  due 
to  package  related  mechanisms.  The  reference  fallout  rate  can  therefore  be  defined  as  follows: 

FOr  (package)  *  (1.66%)  (.25)  =  .415% 

To  modify  these  failure  rates  to  VLSI/VHSIC,  the  package  fallout  rates  in  Table  5-21  were 
derived  from  the  VHSIC/VHSIC-like  database  (Appendix  D)  for  devices  subjected  to 
environmental  testing  consistent  with  a  Class  B  screen.  For  example,  .79  was  derived  by 
observing  the  percentage  of  nonhermedc  chip  earners  that  failed  from  a  combination  of  burn-in  and 
environmental  testing.  The  column  labeled  "total"  is  a  weighted  average  and  was  derived  by 
observing  die  total  fallout  rate  (i.e.,  devices  failed/devices  tested)  for  both  hermetic  and  plastic 
parts  for  each  package  type.  Similarly,  the  row  labeled  "total"  was  obtained  by  calculating  the  total 
fallout  rate  for  both  hermetic  and  noohemietk;  packages. 

Hie  fact  that  the  FOr  listed  above  is  very  close  to  the  total  fallout  rates  in  Table  5-21, 

although  coincidentai,  indicates  that  there  is  good  agreement  between  these  two  separate  methods 
cf  obtaining  fallout  rate,  and  lends  a  degree  of  confidence  in  these  results. 
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TABLE  5-21: 

OBSERVED  VHSICyVHSIC-LIKE  PACKAGE  FALLOUT  RATE 


Nonhermetic 

Hermetic 

Total 

Chip  Carriers 

.79% 

1.4% 

1.03% 

Pin  Grid  Array 

.69% 

.33% 

.49% 

DIP 

.18% 

.30% 

.22% 

Total 

.41% 

.61 

Since  there  is  not  a  significant  difference  in  the  hermetic  vs.  nonhermetic  fallout  rates,  there 
will  be  no  distinction  between  these  for  the  base  failure  rates  although  a  failure  rate  term  will  be 
added  for  the  nonhermetic  effects  of  plastic  packages.  Using  the  totals  for  the  three  packages  types 

for  FOv  the  ratios  can  be  defined  as  in  Table  5-22,  along  with  the  package  type  factor  (normalizing 
DIFs  to  a  value  of  1),  For  example,  the  FO/FOr  factor  for  DIP'S  is  .22/.415  » .53. 


TABLE  5-22: 

PACKAGE  TYPE  FACTOR 


TYpa 

FO/FOr 

nPT 

Chip  Carriers 

2.48 

4.67 

Pin  Grid  Array 

1.18 

2.23 

DIP 

33 

1 

5.5.4  tes 

The  package  failure  rate  can  tiKUxforc  be  summarized  as: 


^PAC  53  ^BP  %  nSp  Hpr 
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nSP 

Ilpr 


El. 


jx  Factor 


Package  Screening  Factor 
Package  Type  Facta: 


nE 


gb 

1.0 

Ajp 

8.15 

auf 

12.5 

nPT 


DIP 

1 

PGA 

2.23 

Chip  Carrier 

4.68 

%> 


No  Screening 

10 

Bum-In 

8 

Env,  Series 

2.8 

Bum-In/Env.  Series 

1 

Since  the  relative  values  of  the  environmental  factor,  screening  factor,  and  package  type 
facto-  have  been  previously  derived,  die  base  package  failure  rate  was  derived  by  setting  the 
observed  package  failure  rate  equal  to  the  predicted  for  a  known  case  and  solving  for  XBP.  Since 
there  is  high  confidence  is  the  Ajp,  Class  B  observed  failure  rates  for  DIPs  were  the  ones  used. 

This  failure  rate  is  .285,  of  which  25%  is  due  to  die  package,  yielding  an  observed  failure  rate  of 
.071  P/106  hrs.  Since  the  FG/FOr  factor  for  DIPs  is  .53  (Table  5-22),  and  die  factor  was 

normalized  to  1  for  D IPs,  die  failure  rate  (XPAC)  for  Dip's  is  (.071)  (.53) » .037  (F/106).  The 

value  of  ,53  is  due  to  increased  reliability  of  DIP  packages  from  the  time  the  observed  field  data 
was  collected  to  tbotkac  the  fallout  rate  dm  collected  for  duseffort  was  obtained  Therefore, 

.037  «S  XgidlgllgplIpY 

,  .037  .037 

0046 


It  has  been  shown  in  past  studies  that  the  package  failure  rate  is  also  a  function  of  package 
complexity,  of  which  number  of  pins  is  one  measure.  From  Reference  9  the  following 
relationships  were  obtained  for  the  package  failure  rate  of  DIFs  up  to  64  pins. 

^package  =  1%  nq  [.00044  +  .00042  (#  pins)]  Hermetic  DIPs 

^package 53  1%  IIq  [.0035  +  .00009  (#  pins)]  Nonhermetic  DIPs 

The  data  used  to  derive  these  numbers  were  predominantly  from  commercial  quality  devices 
in  a  ground  benign  application  with  nE  -  .38  and  rig  =  17.5  (hermetic)  and  IIq  =  35  (for 

nonhermetic). 

Therefore,  using  these  relationships  to  derive  the  XpflC]cage  yields; 

XpAC  (Hermetic) » .00293  +  .00279  (#  Pins) 

XpAC  (Nonhermetic)  *  .0465  4-  .00120  (#  Pins) 

If  these  relationships  are  extrapolated  to  the  pin  counts  of  VHSIC  type  devices,  the  package 
failure  rate  and  the  number  of  pins  .would  be  directly  proportional.  That  is,  a  200  pin  package 
would  have  twice  the  failure  rate  as  a  100  pin  package,  this  is  not  intuitively  appealing  since  there 
are  many  package  related  failure  mechanisms  that  are  not  complexity  dependent.  Failure 
mechanisms  relating  to  tire  lead,  lead/ssal  interface,  and  wire  bonds  can  be  considered  complexity 
dependent  since  they  are  essentially  independent  of  each  other  in  a  reliability  senco.  Conversely, 
tire  die  bond  is  considered  complexity  independent  since  there  is  only  otic,  which  is  independent  of 
the  lead/wire  bond  assembly.  Data  from  Reference  75,  which  provides  failure  mode  data  on  a 
wide  variety  of  part  tyjres,  indicates  that  this  assumption  is  a  reasonable  approximation  for  a 
general  use  reliability  model  To  alleviate  this  situation  in  lieu  of  enough  empirical  data  to  precise^ 
define  this  relationship,  the  assumption  was  made  that  tire  complexity  dependent  and  complexity 
independent  package  failure  rates  are  equal  at  the  average  complexity  value  in  the  VHSIC  database 
(120  pins).  Therefore,  this  yields  die  following  base  package  failure  rate  to  be  used  in  this  model 
is: 


Xjjp  « .0024  4  1.85  x  I0*5  (#  Pins) 
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5.5.5  Junction  Temperature  Calculation 


In  calculating  the  junction  temperature,  the  average  junction  temperature  should  be  used.  The 
standard  method  of  accomplishing  this  is  adding  the  case  temperature  to  die  temperature  rise  due  to 
power  dissipation; 

Tj  =  Tc+  0JC  P 

where:  '  -  Case  Temperature 

0j  c  =»  Junction-CaseThermal  Resistance 
P  Worst  Case  Actual  Power 

To  calculate  Tj,  the  actual  6JC  of  die  device  being  predicted  should  be  used.  If  it  is  not 
known,  however,  the  Mowing  alternate  method  using  6JA  can  be  used; 

Tj“TA+eup 

where  0jA  is  the  junction  to  ambient  thermal  resistance..  Table  5-23  summarizes  0JA's  values 
wliich  were  derived  by  various  manufacturers  and  represent  typical  values.  (Note:  Them  arc  very 

wide  variations  in  these  values). 

* 


ns 


TABLE  5-23: 

JUNCTION  -  AMBIENT  THERMAL  RESISTANCE  VALUES 


eJA  (°c/w) 


#  PINS 

PLCC 

CLCC 

PPGA 

CPGA 

CDIP 

PDIP 

24 

92 

42 

50 

50 

28 

65 

40 

- 

- 

50 

50 

40 

65 

40 

- 

- 

43 

43 

44 

61 

40 

- 

- 

42 

42 

48 

58 

38 

- 

- 

38 

38 

52 

56 

38 

- 

- 

38 

38 

64 

50 

37 

- 

- 

30 

30 

68 

46 

36 

90 

36 

- 

84 

44 

35 

81 

34 

- 

120 

39 

34 

81 

34 

- 

124 

39 

34 

75 

31 

- 

144 

37 

33' 

68 

30 

180 

36 

31 

57 

29 

200 

36 

30 

50 

29 

200 

36 

30 

48 

29 

5.5,6 


Although  there  was  not  a  significant  difference  in  fallout  rates  observed  between  plastic 
encapsulated  and  hermetic  devices,  there  is  reason  to  believe  that  the  long  term  failure  rate  for 
nonheimetic  devices,  under  high  humidity  and  high  temperature  conditions  would  be  worse  than 
hermetic  packages  due  to  moisture  penetration  and  corrosion.  To  account  for  this  effect  the 
literature  was  reviewed  for  models  r  suing  failure  rate  and  variables  such  as  relative  humidity, 
tempernmte  and  power  dissipation.  Numerous  models  have  been  proposed  (References  15, 16, 
17, 18, 19, 20  and  22),  but  the  one  used  in  this  model  to  account  for  nonhermctic  package  types  is 
given  in  Reference  10,  where  the  effects  of  ambient  relative  humidity,  temperature,  and  the 
effective  chip  humidity  have  been  modeled. 
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The  time  to  failure  distribution  for  the  corrosion  associated  with  moisture  is  modeled  with  a 
lognormal  distribution  and,  per  Reference  10,  has  a  mean  life  of: 

All  296 

t50  =  AeET  eRHjEFF 


where  AH  is  the  activation  energy  (.2  eV)  and  RHgpp  is  the  effective  relative  humidity. 

If  the  duty  cycle  is  not  1.00,  the  average  effective  RHgpp  must  be  used  in  the  i$q  equation. 
Calculating  this  average  value  as  a  function  of  the  junction  and  ambient  RH’s  yields: 

RHgpp  «  DC  RHepp  (op)  +  (1  -  DC)  RHeff  (dor) 


where; 

DC  -  duty  cycle  (%  operating  time) 

RH  -  relative  humidity  of  the  environment 
RHeff  (op)  ^operating  effective  RH 
RHeff  (dor)  »  dormant  effective  RH 

»  normalizing  temperature 
Tj  j  »  operating  junction  temperature 
Tjj  “  nonoperating  junction  temperature  (Tj2  =  TA) 

RHgpp  « (DC)  (RH)  e5230  ’  TS?  +  0  -  DC)  (RH) 

Oj«TA+eJAp) 
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Normalizing  the  temperature  factor  to  25°C  and  calculating  the  A  constant  according  to  the 
tjo  values  from  Reference  10,  yields  the  following  t5Q  expression  of  the  lognormal  distribution: 

.2  ,  1  1  2.96 

t5Q  =  .000086  e  8  63x105  ^  c&Heff 

where  tjg  is  in  106  hrs.  and  (0  <  RHgpp  <  1). 

It  is  well  understood  that  there  are  typically  large  differences  between  manufacturers  (and 
even  large  variations  within  manufacturers)  regarding  the  quality  and  reliability  of  the  plastic 
encapsulate  material.  However,  as  with  the  other  wearout  mechanisms  modeled,  the  lognormal 
expression  provides  an  estimate  of  the  time  at  which  the  failure  rate  can  be  expected  to  increase 
and  the  end  of  life  may  be  approaching. 


5.3.7  ^grifviag  Cacfcigg,EaUure  ilaiaiwith  Tcmncratmc  Cycling  Data 

Sines  the  data  collected  for  this  effort  indicates  that  package  related  failures  are  primarily 
accelerated  by  tcanperature,  an  exercise  was  undertaken  to  correlate  the  package  failure  rate  with  tire 
number  of  temperature  cycles  the  device  has  been  exposed  to.  Although  the  results  from  this 
exercise  will  not  be  used  in  the  final  model,  it  was  useful  to  observe  the  degree  of  correlation 
which  existed  in  the  rer  fs  of  these  two  methodologies. 

To  accomplish  this,  instead  of  time  being  the  independent  variable,  the  number  of 
temperature  cycles  was  used  and  the  method  of  modeling  the  early  life  failure  rate  mechanisms 
outlined  in  Section  3.0  was  used.  Also,  instead  of  temperature  and  duty  cycle  being  the 
acceleration  factors,  the  temperature  cycling  rate  was  used. 
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TABLE  5-24: 

AIRCRAFT  UTILIZATION  DATA 


FHRS/ 

Sortie 


Landings/ 

Month 


FHRS/ 

Month 


A7D 

YA7D 

*A7 


14.95 

22.81 

14.98 


14.97 

14.17 

14.96 


22.91 

20.20 

22.89 


A10A 

*A10 


17.23 

17.23 


H.21 

17.21 


3131 

31.31 


A37R 

0A37B 

*A37 


17.20 

19.63 

17.78 


14.36 

8.91 

13.28 


18.64 

11.74 

17.28 


B52G 

B52H 

*B52 


12.72 

10.89 

7.73 


31.64 

31.11 

20.50 


FB111A 
♦FBI  11 


12.78 

12.78 


21.84 

21.84 


C5A 

*C5 


36.43 

36.43 


11.44 

11.44 


56.61 

56.61 


C130A 

C130B 

C130D 

C130E 

C130H 

*030 


32.04 

39.58 

27.79 

51.80 

52.59 
42.25 


12.27 

15.22 

14.83 

21.89 

24.33 

17.45 


27.31 

34.52 

26.94 

51.09 

61.06 

42.44 


♦Indicates  AN/ARN-118  RIW  data. 
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TABLE  5-24: 

AIRCRAFT  UTILIZATION  DATA  (CGNTD) 


C141A 

3.46 

55.66 

25.10 

86.83 

N0141A 

2.48 

20.92 

6.61 

16.59 

Y0141B 

1.60 

18.17 

9.00 

14.18 

*C141 

3.46 

54.28 

24.34 

84.05 

KC135A 

4.13 

19.59 

6.51 

26.85 

NKC135A 

2.76 

15.73 

5.76 

15.98 

KC1350 

4.60 

17.89 

6.68 

30,68 

">£135 

4.36 

19.90 

6.86 

29.85 

F4C 

1.30 

14.27 

11.40 

14.86 

RF4C 

1.53 

17.27 

13.43 

20.64 

F4D 

1.30 

16.76 

13.44 

19.54 

F4E 

1.29 

'  17.50 

15.14 

19.54 

F4G 

1.33 

15.04 

14.68 

19.40 

*F4 

1.34 

16.66 

13.76 

18.48 

F5E 

0.97 

25.60 

25.50 

24.59 

*F5 

0.98 

28.34 

25.02 

24.49 

ri5A 

1.30 

15.05 

15.00 

19.54 

F15B 

1.36 

31.15 

17.36 

23.63 

FISC 

1.42 

15.91 

15.54 

21.90 

F15D 

1.40 

13.41 

12.85 

17.73 

*F15 

1.32 

17.31 

i; 

15.45 

20.45 

F16A 

1.26 

15.28 

14.74 

18.6 

17.43 

F16B 

1.24 

28.52 

14.04 

*P16 

1.26 

20.95 

14.46 

18.20 

TABLE  5-24: 

AIRCRAFT  UTILIZATION  DATA  (CONTD) 


Wm 

i 

Sorties 

Month 

'  £hrs/ 

Month 

Type 

F111A 

2.32 

11.75 

6.95 

16.10 

EF111A 

3.21 

9.25 

9.25 

28.68 

F111D 

2.24 

11.97 

8.07 

18.07 

FI  11E 

2.49 

8.08 

7.27 

17.88 

F111F 

2.49 

8.28 

7.27 

18.13 

♦Fill 

2.36 

10  33 

7.37 

17.48 

T37B 

1.27 

84.72 

23.97 

30.51 

*T37 

1.27 

84.72 

23.97 

30.51 

T38A 

1.22 

77.48 

22.87 

28.00 

*T38 

1.21 

73.84 

22.89 

27.57 

^Indicates  AN/ARN-IiB  RIW  data. 


Thk  cycling  rate  data  yielded  of  Figure  5-2  i  an  approximate  minion  cycling  rm  of: 

3  cycles  cycles 

saras: 

From  the  aircraft  utilisation  data,  a  typical  fights  will  Jiave  a  duty  cycle  of: 


DC 


20  flight  hrs.  per  month 


DC  » .027 
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Next,  a  relationship  was  derived  between  the  number  of  temperature  cycles  and  failure  rate. 
The  same  method  used  to  develop  the  other  early  life  failure  rates  was  used  (outlined  in  Section 
3.0).  Table  5-25  summarizes  the  data  used.  The  first  column  lists  the  number  of  cycles  interval  at 
which  the  population  of  devices  were  tested  for  failure,  the  second  column  lists  the  total  number  of 
part  cycles  in  that  interval  and  the  last  column  lists  the  number  of  observed  failures  in  that  interval. 
Table  5-26  summarizes  the  data  used  to  derive  this  relationship. 

TABLE  5-25: 

PACKAGE  DATA  AS  A  FUNCTION 
OF  TEMPERATURE  CYCLING 


Cycles  (10^  Cycles) 

Part  Cycles 

Number  Failed 

0-0.000100 

0.762800 

2 

.000101  -  0.000300 

1.451700 

31 

.000301  -  0.000500 

1.379606 

90 

.000501  -  0.001000 

3.190000 

91 

TABLE  5-26: 

PACKAGE  SUMMARY  DATA 


#  Cycles  Range 

Midpoint 

Part  Cycles 

Failure  Rate 

.  (U$  cycles) 

.  (106) 

(10®) 

Failures 

(F/106  Cycles) 

0  -  .000500 

000250 

3.5941 

123 

34.2 

.006500  ".001000 

.000750 

3,1990 

91 

28.2 

Recessing  m  thk  data  yields  the  following  relationship: 

X  <#  cyeks) »  37  e  ’365  *#  v>Ues) 
(where  #  cycles  is  in  IQ**  cycles) 


12? 


In  the  case  of  the  package  failure  rate,  a  temperature  acceleration  factor  is  not  applicable, 
rather  it  is  the  number  of  temperature  cycles  the  device  has  been  exposed  to  that  is  important 
Therefore,  the  temperature  acceleration  factor  can  be  replaced  by  a  duty  cycle  which  is  the 
percentage  of  time  the  device  is  being  cycled  at  a  rate  of  .9  cyc^lir.  As  opposed  to  the  other 
models  for  X,ox  etc.,  in  this  model  the  package  failure  rate  is  a  worst  case  value  that 

improves  with  a  decreasing  duty  cycle.  Although  it  is  recognized  that  failures  due  to  temperature 
cycling  are  primarily  wearout  related,  the  model  was  fit  to  data  from  a  wide  variety  of  devices, 
which  tend  to  "smooth  out"  the  failure  rate  function. 


Adding  the  effects  of  screening  and  duty  cycle,  and  converting  number  of  cycles  to  time 
(using  .9  cycles  per  hour)  yields  the  following; 


.365  (#  cycles) 


where:  F  =*  Number  of  Failures 

^  «s  Cycles  per  hour 

DC  ®  Duty  Cycle 


For  a  typical  duty  cycle  factor  of  .027,  the  initial  failure  rate  would  be  .90.  This  failure  rate 
is  somewhat  pessimistic  because  the  temperature  extremes  observed  in  tire  test  data  from  which  it 
was  derived  is  greater  than  the  actual  temperature  extremes  associated  with  an  uninhabited  fighter 
environment  The  lower  failure  rare  yielded  from  tire  package  factor  presented  previously  were 
developed  from  actual  field  data,  representative  of  a  lower  temperature  extreme  cycling  situation. 
Decreasing  the  temperature  extremes  could  easily  decrease  tire  failure  rate  by  an  order  of 
magnitude.  Also,  the  data  used  to  derive  this  relationship  1$  from  a  combination  of  DIFs,  Pin  Grid 
Arrays,  and  Chip  Carriers,  the  mixture  of  which  is  inherently  less  reliable  than  the  DIP  data  used 
previously.  However,  this  exercise  did  provide  an  expected  worst  case  failure  rate  which 
compares  favorably  with  the  package  failure  rate  equation  that  will  be  used  in  this  model. 
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A  summary  of  the  package  failure  rate  is  therefore  given  as  follows  and  the  values  for  these 
factors  in  the  following  tables, 

XpAc  =  (.0024  +  1.85  x  10-5  (#Pins))  nE  Kgp  Kpj  +  XPH 

Application  Environment  Factors  (Tig) 


Environment 

nE 

Environment 

°E 

.52 

aIB 

gMS 

.88 

aIA 

Gf 

3.4 

Ajp 

gM 

5.7 

AUC 

Mp 

5.2 

aUT 

nsb 

5.4 

aUB 

NS 

5.4 

aua 

% 

7.7 

auf 

% 

8.0 

SSF 

NUU 

8.6 

Mpp 

%W 

12 

mfa 

AIC 

3.4 

Ml 

Arr 

4.0 

cl 

6.8 

5.4 


8.1 

4.0 

5.4 

10 

8.1 

12 

1.2 

5.3 

15 

1? 

300 


Package  Screening  Factor  (11^) 


Ser»n  Level 

Screen  Class 

nSP  1 

No  Screening 

D 

10 

Bma-In 

- 

8.0 

Environmental 

- 

2.8 

Burn-In/Eiivironmentai 

B 

1.0 

Package  Type  Factor  (IIpj.) 


Package  Type 

7tpT 

DIP 

1.0 

Pin  Grid  Array 

2.2 

Chip  Carrier 

4.7 

s*  Package  Hcm^ticity  Factor 
«  0  fiar  Heaastic  Packages 

X»u  sl^exp  F (  Ut(t)  4*  ll 


399  f  *  1  /  \21 , 

sa  exp  j  ^(0  *  ^SOpHV  j  nanlienuede  packages 

’  wj  °*PK’ 


Ta  -  Ambient  Temp. 


im&p^iDcxmc- 


,5230  i 


&■£) 


+  (1*DC)(RH) 


($ar  example,  for  50 %  Relative  Humidity,  use  RH  «  50) 
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5.6  ELECTRICAL  OVERSTRESS  FAILURE  RATE 


The  occurrence  of  a  catastrophic  electrical  overstress  (EOS)  event  is  an  event  related  failure 
mechanism  since  it  is  the  result  of  an  externally  supplied  voltage  or  current.  Since  it  is  event 
related  and  not  an  inherent  reliability  failure  mechanism,  it  is  independent  of  time  and  dependent 
only  on  the  probability  of  an  EOS  event,  the  magnitude  of  the  overstressing  voltage  or  current,  and 
the  susceptibility  of  the  device  to  damage.  It  can  therefore  be  modeled  as  a  constant  failure  rate  as  a 
function  of  susceptibility  level. 

Although  there  are  many  types  of  EOS  sources,  each  with  their  own  characteristics,  the  best 
source  of  susceptibility  information  (and  in  most  cases  the  only  source)  is  the  tests  specified  in 
MIL-STD-883,  Method  3015.  Although  this  is  only  one  measure  of  the  EOS  susceptibility  of  a 
device,  it  is  the  most  readily  available  and  therefore  will  be  used  as  an  input  into  the  model.  The 
assumption  in  using  this  approach  is  that  the  EOS  and  ESD  susceptibility  levels  are  highly 
correlated. 


Although  it  is  also  recognized  that  die  electrical  environment  in  which  die  device  is  deployed 
is  a  primary  factor  in  the  EOS  failure  rate,  it  cannot  be  used  in  the  model  since  users  of  die  model 
often  have  no  control  of  Use  final  environment  and  do  not  know  the  EOS  characteristics  of  it.  In 
some  cases,  the  relative  EOS  severity  between  environments  can  be  defined.  For  example,  there 
appears  to  be  more  EOS  failures  from  devices  in  avionics  equipment,  where  power  quality  is 
always  a  concern.  However,  to  quantify  the  magnitude  of  severity  levels  as  a  function  of 
enviixmment  would  be  unduly  complex  and  is  beyond  die  scope  of  this  study.  Therefore  the 
objective  in  deriving  an  EOS  factor  is  that  a  “typical"  EOS  failure  rate  be  derived  only  as  a  function 
of  tire  ESD  susceptibility  of  rite  device.  By  modeling  the  EOS  failure  rate  in  tins  manner,  it  is 
essentially  being  treated  as  an  environmental  stress. 

The  basic  premise  for  this  factor  is  die  following  relationship: 

P(0-P(fyc)P(c) 
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where: 


P(f)  is  the  probability  of  field  failure  due  to  Electrostatic  Discharge  or  Electrical  Overstress 

P(f/c)  is  the  probability  of  failure  given  that  the  device  has  been  contacted  by  the  EOS/ESD 

source 

P(c)  is  the  probability  of  contacting  the  device  with  the  EOS/ESD  source 

To  derive  the  probability  of  failure  due  to  an  EOS/ESD  event,  an  EOS/ESD  modal  failure  rate 
was  developed  by  deriving  a  representative  failure  rate  for  integrated  circuits  (for  a  variety  of 
device  types  and  environments)  and  multiplying  this  failure  rate  by  the  percentage  of  failures 
observed  to  occur  as  a  result  of  electrical  overstress.  These  values  were  derived  from  the 
Reliability  Analysis  Center's  data,  which  indicates  that  a  total  of  7948  failures  were  observed  in  a 
total  of  16116  million  part  hours  (yielding  a  failure  rate  of  .493).  Also  from  the  RAC  failure 
analysis  database  (Reference  75),  8.S  percent  of  all  failures  in  the  database  were  due  to  EOS/ESD, 
indicating  that  an  EOS/ESD  modal  failure  rate  of  .0419  (F/10^).  Therefore,  under  typical 
conditions  in  one  year  of  operation,  the  probability  of  failure  (P(0)  due  to  BOo/ESD  is: 

l^f)«l-o4ESDt«,00D367 

To  derive  a  probability  of  failure  given  the  device  has  been  contacted  with  an  EOS/ESD  pulse 
IWc),  data  (from  Reference  77)  was  used  which  identifies  the  failure  voltage  distribution  for  all 
microcircuits.  This  distribution  is  lognormal  with  a  mean  of  sppiodmateiy  2209  volts.  The 
population  from  which  this  distribution  was  obtained  and  die  poptiadoa  from  which  the  EOS/ESD 
modal  failure  rate  was  calculated  should  be  very  similar  since  they  are  representative  of  a  good 
cross-scctioa  of  device  types,  technologies,  and  operational  environments. 

from  data  available  m  the  Ihmtwe  (Reference  1)  the  voltage  distributions  given  in  Table  5- 
27  were  defined  for  the  stressing  voltage  as  a  function  of  die  level  of  ESD  protection  in  a  given 
area  (based  on  a  normal  distribution).  The  average  listed  in  the  table  is  the  distribution  used  to 
represent  the  voltages  present  in  the  environment  of  the  devices  from  which  the  EOS/ESD  modal 
failure  rate  was  derived. 
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TABLE  5-27: 

ESD  SOURCE  VOLTAGE  DISTRIBUTIONS 


ESD 

Protected 


ESD 

Unprotected 


Avcra 


Mean 

Std.  Deviation 


It  is  reasonable  to  assume  that  the  stress  voltage  distribution  is  not  normally  distributed  as 
previously  presumed  (Reference  1)  but  rather  lognormally  or  exponentially  distributed.  As 
illustrated  in  Figure  5-22,  an  exponential  distribution  is  intuitively  appealing  since  the  probability 
of  having  a  given  voltage  present  in  a  particular  situation  increases  with  decreasing  voltage. 


Mean  »  4500  V 


Voltage  Level 


FIGURE  5-22: 

STRESS  VOLTAGE  DISTRIBUTION 


Assuming  the  exponential  distribution  and  using  the  mean  voltage  for  the  normal  of 4500  V 

1 

and  e  » this  to  the  mean  of  an  exponential  (6 = —)  yields  a  9  of  .0002  and  the  following  stress 

At 


voltage  distribution: 


(V  STRESS  =  *0002  ^ 


Calculating  the  contact  rate: 


for  t  =  .00876  (xlO6  hrs.) 


P(f) »  1  -  e*^E0S  1  «  1  -  .999633  « .000367 


(from  empirical  data  XgQj  » ,0419  @  t « .00S76  x  10^  hrs.) 


Assuming  a  mean  threshold  voltage  of 2200  volts: 

2200  V  -mm  Vyh 
j  .0002  c  ™dV 

as  ,§44 


where; 


Wm  *  ESD  Voltage  UtreshoM 


/.  ,  r.  -.0002  VthI  -.0002  Vth'I 

Via  general  Pff/c) »  I-|_l  J  *  -e  / 


m  wI§|y“^llr  *  .00us? 
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^l^i=  065  contacts 
^  t  in< 


10°  hrs. 


tAEOS  1 3  i  (-In  (1  -  P({/c)  (.00057)) 
P(tfc)  -  -e*-0002  Vth 


Therefore, 


''EOS 


-In  (1  -  .00057  e'-0002  VTH)  6 

- S - -^fg - 2  rntfi  to-) 


A  graph  relating  Xggg  to  ESD  susceptibility  level  is  given  in  Figure  5-23. 


HOUSE  S- 23:  W' AS  A  FUNCTION  OF  SUSCEPT1BIUTY 


5 .7  MISCELLANEOUS  FAILURE  RATE 


A  miscellaneous  failure  mechanism  category  was  defined  since  there  were  various  failure 
causes  in  the  database  which  could  not  be  categorized  into  one  of  the  previously  defined  early  life 
failure  mechanisms.  Rather  than  separating  each  of  these  miscellaneous  mechanisms  into  a 
separate  failure  rate  which  would  make  the  model  more  complex,  they  were  summarized  into  one 
category  which  includes  various  failure  mechanisms  relating  to  the  assembly  and  time  dependent 
package  failures  causes.  This  miscellaneous  failure  rate  is  intended  to  be  stricktly  an  emperical 
relationship  used  only  to  allow  the  predicted  failure  rates  to  be  as  close  to  the  observed  data  as 
possible. 

The  package  related  failure  rate  derived  previously  was  based  only  on  failures  induced  by 
temperature  cycling.  There  were  a  small  percentage  of  package  failures  that  occurred  as  a  result  of 
accelerated  operational  life  testing.  These  failures,  since  they  occurred  during  life  tests  are 
considered  time  dependent  and  are  included  in  this  miscellaneous  category. 

To  derive  this  factor,  the  method  (outiiw?d  in  Section  3.0)  used  for  the  other  early  life  failure 
rates  was  used.  Since  it  was  not  a  single  mechanism  being  modeled  an  equivalent  activation 
energy  had  to  be  derived.  This  was  accomplished  by  weighting  die  acceleration  due  to  temperature 
for  each  mechanism  in  accordance  will!  the  number  of  failures  for  that  mechanism  (Reference  7). 
Figure  5*24  illustrates  tills  concept. 


a 

b. 


2S°C 


TottpefaUirc 


—f - 

125  °C 


> 


FIGURE  5-24:  HYPOTHETICAL  MIXTURE  uF  FAILURE  RATES 
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Since  the  database  contains  primarily  125°C  life  test  data,  the  number  of  failures  at  25°C  was 
projected  for  each  failure  mechanism  based  on  the  observed  number  of  failures  at  125°C.  The 
equivalent  activation  energy  was  then  calculated  based  on  the  weighted  and  combined  individual 
accelerations.  This  activation  energy  wac  fou^d  to  be  .423  eV,  and  will  be  the  one  used  to 
calculate  the  equivalent  part  hours  for  derivation  of  the  miscellaneous  early  life  failure  rate.  To 
derive  this  factor,  the  data  in  Table  5-28  was  used.  The  summarized  data  used  in  derivation  of  this 
factor  is  given  in  Table  "  -29: 


TABLE  5-28: 

FAILURE  DATA  FOR  MISCELLANEOUS  FAILURE  RATE 


f— - - — ' - — - " 

Effective  Time 

Interval  (10**  hrs.) 

Effective 

Part  Hours  (10**  hrs.) 

Number  of 

Failures 

0-0.011543 

3175.006037 

2 

.011544  -  0.020183 

2372.378700 

1 

.020184  -  0.040402 

5501.966794 

42 

.040403  -  0.066083 

4944.568256 

1 

.066084  -  0.070642 

868.808198 

1 

.070643  -  0.082514 

2198.587890 

2 

.082515  -  0.112394 

5498.847131 

1 

.112395-0.120244 

1430.374353 

6 

.120245  -  0.210243 

15242.040931 

15 

.210244  -  0.240488 

5036.427332 

4 

.240489  -  0.242412 

29.643048 

5 

.242413  -  0.355239 

1255.910793 

1 

.355240  -  0.393352 

458.155979 

1 

.393353  -  0.420487 

271.892485 

1 

.420488  -  1.510862 

200.857977 

4 

TABLE  5-29: 

SUMMARIZED  MISCELLANEOUS  FAILURE  DATA 


Effective  111116 

Failure 

Interval 

Part  Hours 

Number 

Rate 

(10^  hrs.) 

KESi 

■SB 

of  Failures 

(F/106  hrs.) 

0-  .040402 

.020201 

11049 

45 

.00407 

.040403  -  1.5 10862 

.73520 

37434 

42 

.00112 

This  data  yielded  the  following  relationship: 


.0042  e'^.2 1 


Accounting  for  unknown  failures  and  adding  the  effects  of  acceleration  due  to  temperature 
and  screening  yields: 

Xkus  =  (.01  e'2-2 '»)  (ATmis)  (e‘2-2  ATM1S') 
where  ^Tmis  53  Accdemtion  Factor 

T  -.423  /l  1  Y1 

■  “HisnFivwJJ 

tQ  «  Effective  Screening  Tune 

»  ^  Screening  Temp.)  *  Actual  Scieeaing  Time  (in  10^  hours) 

t  o  time  (in  10^  hrs.) 
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6.0  DETAILED  MODEL  SUMMARY 


This  section  summarizes  the  detailed  model  in  its  entirety.  Table  6-1  summarizes  the  input 
parameters  of  the  detailed  model  and  their  default  values.  The  default  values  are  either  averages  of 
the  values  contained  in  the  database  or  "typical”  values. 


TABLE  6-1: 
INPUT  PARAMETERS 


VARIABLE 

UNITS 

DEFAULT 

Die  Area 

cm2 

.21  cm2 

Device  Type 

* 

1)  Custom  and  Logic 

2)  Memory  and  Gate  Array 

Defect  Density 

Def/cm2 

1 

Feature  Size 

Micron 

2 

Power  (Actual)  (P) 

Watt 

* 

Theta-  JA  (0JA) 

°C/watt 

(See  Table  5-23) 

Theta- JC  (0JA) 

°C 

- 

Ambient  Temperature  (TA) 

°c 

* 

Screening  Duration 

10^  hours 

* 

Screening  Temperature 

°C 

* 

Screening  Power  Dissipated 

watt 

* 

Current  Density  in  Metal  (J) 

10^  X  /cm2 

.5 

Electric  Field  in  Oxide  (Eox) 

MV/cm 

2 

Substrate  Current  (Isub) 

mA 

.0058  e  '-00689 ' 

Drain  Current  (1^) 

mA 

3.5  e  '-00157  TJ 

Proven  Mfg.  Process  (on  QML?) 

* 

1)  yes 

2)  No 

Sigma  Oxide  (Oqx) 

1 

Sigma  Metal  (a^-p) 

- 

.5 

Sigma  Hot  Carriers  (c^) 

- 

1.1 

Package  Type 

- 

* 

Pin  Count  (NP) 

- 

* 

Environmental  Screens  Applied 

* 

'139 

* 

TABLE  6-1: 

INPUT  PARAMETERS  (CONTD) 


Application  Environment 

1)  None 

2)  Bum-In 

3)  Environmental 

4)  Bum-In/Environmental 
ESD  Susceptibility  (Vra) 

Duty  Cycle  (DC) 
Correction  Factor  (tcc) 


UNITS 


Volts 


*  Needed  as  an  input  to  detailed  model 


pages. 


OXIDE  FAILURE  RATE  EQUATION 


1  r  n/in6\  A  ATYPEox  fD0OX 
XQX  (in  F/10°)  =  - 


'R 


% 


[(.0788  e  '7‘7 


t0,  ,A  , -7.7  AToxt 


)  (atox)  (e 


+  --399---cxV(-~Qn  (t  +  t0)  -  In  t5oox)2>|] 


(t+to)aox 


A  =  Total  Chip  Area  (in  cm^)  (typical  values  for  this  area  range  approximately 

from  .1  to  lcm^) 

AtypEqx.  =  -77  for  Custom  and  Logic  Devices 


=  1 .23  for  Memories  and  Gate  Arrays 

Ar  =  .21  cm^ 


°0ox 

% 

to 


Defect  Density  (If  unknown,  use 


Xof 


v.  v 


where  Xq  =  2  }im  and  Xs  is  the  feature 


size  of  the  device) 

1  Defect/cm^ 

Effective  Screening  ’lime 


(Actual  Time  of  Test  (in  10^  hrs.))  *  (ATqx  (at  junction  screening  temp.) 
(in  °K))*  (Ayox  (at  screening  voltage)) 
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OXIDE  FAILURE  RATH  EQUATION  (CONTINUED) 


Temperature  Acceleration  Factor 


r  ~3 

f1 

IV] 

.8.63xl0-5 

K 

"  558) 

where  Tj  =  Tc  +  0j^P  (in  °K) 

=  e-192  (rb  -  5T> 

Eox  =  Maximum  Power  Supply  Voltage  VDD,  divided  by  the  gate  oxide  thickness 
(in  MV/cm) 

=  13Zf2£ML)  (i-^hxs.) 

^OX  AVox 

(QML)  =  2  if  on  QML,  .5  if  not 

=  Sigma  obtained  from  test  data  of  oxide  failures  from  the  same  or  similar 
process.  If  not  available,  use  a  cQx  value  of  1. 


Time  (in  106  Hours) 


fAAjypg  D0met  _-1.18  t^/A 

W  =  [ - Aj£ - D^  (-00102  e  °><ATMET  )(C 


-1.18  A  t, 
TMET  ) 


.399  (  -.5  ^ 

<^^Txp  (  “ '  50met) 


atype 


Total  Chip  Area  (in  car)  (typical  values  for  this  area  range  approximately 
from  .1  to  1  cm^) 

.88  for  Custom  and  Logic  Devices 


=  1 . 1 2  for  Memory  and  Gate  Arrays 


.21  cm'1 


)qmet  =  Defect  Density  as  Calculated  in  Appendix  B  (If  unknown  use  (jr-) 

S 

where  Xq  =  2  |im  and  Xj  is  the  feature  size  of  the  device) 


1  Defect/cm^ 


AI}VIEr 


Temperature  Acceleration  Factor 


-•55  (±  _  1  V] 

.63xl0'5  [TJ 


TJ  -  tcase  +  ®JC^  °K) 

=  Effective  Screening  Time  (in  10^  hrs.) 


A_  (at  Screening  Temp,  (in  °K))  *  (Actual  Screening  Time  (in  10°  hrs)) 
‘MET 

(to  Calculate  tQ  use  A.^  based  on  the  junction  temperature  during  screening) 


IfllSWfiJ 


I  »JJ  tt  3  JX  >  3  ?•!»/;*  I  W.C1CMJCM*  )CLl}aH 


(QML)  '388  VMetal  Type>  (in  106  hrs.) 

J%er 

(QML)  =  2  if  on  QML,  .5  if  not 

Metal  Type  =  1  for  A1 

37.5  for  Al-Cu  or  for  Al-Si-Cu 

J  *  The  mean  absolute  value  of  Metal  Current  Density 
(in  10^  Amps/cm^) 

sigma  obtained  from  teat  daft  on  elcetromigration  failures  from  tire  same 
or  a  similar  process.  If  tltis  to  is  not  available  use  »  1 


dine  (in  10^  hrs.) 


RQI.CMRIER  ^MJJR^.RATE.EQ^ATIO^ 


^HC 


.399 

«hc 


“P[^?(1°<t+t0)'lnt50Hc)2] 


^Ohc 


(QML)3.74xlO*5 

fisuh} 

Ay  ] 
*HC 

d 

d  J 

(QML) »  2  if  on  QML,  5  if  not 

where  T}  Tc  8JCP  (in  °K) 

%  s»  Dsaia  Cuixeat  at  0|miiag  Teajpcraiuic,  If  unknown  use 
IjI  « .0058  c  --00157  Tj  0*  °K)  (klA) 

*sub  te  Subshatc  Cunem  at  Opexaiing  Terapasiure.  If  unknown  use 
a3Je.. 00639  T,  (to ‘-K)(mA) 

Cjj-  w  sigsia  derive*!  iicm  rest  dam,  if  not  available  use  i.  i 

(0  *  Aj^jj  (atSateaing  Tcmp-CLa  °K»  *  (test  Duration  in  10^  horns) 


CONTAMINATIQN  FAILURE  RATE  EQUATION 


-.0028  to  A  -.0028  At  t 
=  .000022  e  At  c  CON 

1CON 

Factor 

r  -i.o  f\  i  \i 

where  Tj  «  Tc  +  6JCP  (in°K) 

»  Effective  Screening  Time 

68  ^Tcoii  ^at  SC3KCi^  junction  temperature  (in  °K»  *  (actual  screening 
time  in  iO^hts  .) 

t  »  time  (in  10^  his.) 


wamm 


CTifttjraayayTOiPJRwflQgi 


(.0024  +  1.85  x  lO'3  (#Pins))  jce  Jtgp  Jtpj-  +  XPH 
Application  Environment  Factors  (1%) 


Enviionment 


nE 

Enviionment 

nE 

.52 

AIB 

6.8 

.88 

aIA 

5.4 

3  4 

aIF 

8.1 

5.7 

AUC 

4.0 

5.2 

aut 

5.4 

5.4 

alb 

10 

5.4 

aua 

8.1 

7,7 

Ayp 

12 

8.0 

%F 

1.2 

8.6 

Mpp 

5.3 

12 

%A 

IS 

3.4 

17 

4.0 

300 

Screen 

Quality  Level* 

nSP 

No  Screening 

D 

10 

Bum-la 

- 

s.o 

Envtronmciual 

- 

2.8 

Bcm-InyEaviroau^ual 

1.0 

♦Quality  ievdasdctel  in  Table  5.U.74  ofMiL-HLBK-217. 


Package  Type  Factor  (llpj) 


Package  Type 

n^r 

DIP 

i.O 

Pin  Grid  Array 

2.2 

Chip  Carrier 

4.7 

lpH  »  Package  Henrteticity  Factor 
XpH  »  0  for  Hermetic  Packages 


*  ^22,Cxp  ( la{t)  -  totfs/w*)  V  Koahaateoe  packages 

lthM  L«ph4X  r 

'»«.  *  (i '  -*[ras^ 

«  Ambient  Temp,  (in  °K) 

SHjjf ■  (BCKRH) eS230 ('Tj' "  lj)*<WW<TO 
wiss®  Tj-  *  Tc  4-  %CP  (in  °K) 

(for  exatapic,  for  50%  Relative  Humidity,  ust  A  » .50} 
opy  **  -74 

t  *  time  (in  1$  fan  ) 
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MSCEU.4NB0mEAE.URE  RATE  EQUATION 


has  -  (-01e'2-2to)(ATMIS)(e-2-2ATMs') 

ATmb  =  Temperature  Acceleration  Factor 


r  -.423 

1  si 

„8.63xl0"^ 

C’T' 

M]| 

where  Tj  =  Tc  +  eJCP  (in°K) 
to  »  Effective  Screening  Tune 

=  A^^  (at  Screening  Temp,  (in  °K))  *  Actual  Screening  Time  (in  10^  hours) 
t  =  time  (in  10^  hrs.) 
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7.0  SHORT  FORM  MODEL 


7.1  DERIVATION  METHODOLOGY 


The  detailed  model  presented  previously  is  intended  to  model  VHSIC  CMOS  failure  rates  as 
accurately  as  possible.  As  discussed  in  the  introduction  of  this  report,  it  is  recognized  that  there  is 
a  need  for  a  much  simplified  model,  both  in  the  data  required  for  calculation  and  in  the  complexity 
of  the  calculations  themselves. 


To  accomplish  this,  a  short  form  model  was  derived  from  the  detailed  version.  The 
following  model  form  was  chosen  which  has  additive  failure  rates  for  die  related  failure 
mechanisms,  package  related  failure  mechanisms,  and  electrical  overstress;  each  multiplied  by  the 
appropriate  correction  factors: 

Xp  =  ^BD5tMFG'tTTtSD7tCD  +  ^BP^E^SP^PT  +  ^EOS 

where: 


^BD 

%!FG 

fiy 

asr> 

XCD 

xQP 

% 

itSP 

%T 

Xeos 


is  tlje  base  failure  rate  forth®  die 

is  the  correction  factor  based  on  tits  manufacturing  process 

is  the  die  temperaur  s  factor 

is  the  die  screeniitg  factor 

is  the  die  complexity  factor 

is  the  package  base  failure  rate 

is  the  mvironment  factor 

is  rite  package  screening  factor 

is  the  package  type  factor 

is  the  failure  rate  due  to  electrical  overstress 


The  derivation  of  each  of  these  factors  is  summarized  as  follows. 
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7.1.1  Temperature.  Factor 


To  develop  a  temperature  acceleration  factor  for  the  short  form  model,  an  average 
"equivalent"  activation  energy  had  to  be  derived,  recognizing  that  a  single  activation  energy  is  an 
approximation  and  only  applicable  for  a  single  failure  mechanism.  To  accomplish  this,  the  detailed 
model  was  exercised  for  a  variety  of  conditions  and  various  temperatures  and  an  average  activation 
energy  calculated  from  the  resulting  failure  rate  ratios  at  various  temperatures.  As  with  the 
individual  failure  mechanisms,  the  following  temperature  acceleration  factor  form  was  used: 


-Ea 
%  =  eK 


where: 

Ea  is  the  equivalent  activation  energy 

K  is  the  Boltman's  Constant 
Tj  is  the  junction  temperature 

Tj  is  a  reference  temperature 


Calculating  an  average  Ea  yielded  a  value  of  .33  eV.  Therefore,  normalizing  the  acceleration 
factor  to  298  degrees  Kelvin,  the  temperature  acceleration  facto*  becomes: 

>-.33  f\  1  \ 

^eSoO^TT  Wj 


7.1.2  gl&tosaifl&raQhfl 

Tite  overall  effects  of  bum-in  can  also  be  determined  from  the  detailed  model  by  knowing  the 
duration  and  the  junction  temperature  of  the  burn-in.  Since  the  detailed  model  has  a  decreasing 
failure  rate  for  the  defect  related  early  life  failure  mechanisms,  the  effects  of  bum-in  can  easily  be 
determined  by  exercising  the  model  for  various  bum-in  lengths  and  temperatures.  The  failure  rate 
improvement  after  the  bum-in  can  then  be  determined.  The  lengths  and  temperatures  chosen  were 
those  of  Class  D,  B  and  S  (per  MIL-HDBK-217)  quality  level  devices.  For  these  devices,  the 
following  relative  bum  in  factors  were  derived  (normalizing  Class  D  (no  bum-in)  to  1 . 
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TABLE  7-1: 

DIE  SCREENING  FACTOR 


Class 

WEH5M 

D 

i 

B 

.94 

S 

.85 

*QML  parts  will  initially  be  exposed  to  Class  B  screening  as  a  baseline.  Further  refinement  of  this 
factor  will  be  made  as  data  becomes  available. 

7.1.3  DkjCgmplctity  Factor 

The  die  complexity  factor  in  the  detailed  model  was  a  function  of  die  area  and  defect  density. 
The  area  acceleration  factor  was  (Area  (in  cm^))J21 ,  and  the  default  defect  density  value  to  be  used 
in  the  short  form  model  is: 


where:  Xs  ■  feature  size 

The  die  complexity  factor  (11^  is  therefore. 


v 


where 


A  *  Area  in  cm? 

Xs  at  Feature  size  is  microns 


which  yields  tire  relative  complexity  factors  in  Table  7-2,  as  a  function  of  chip  area  and  feature 
ska. 
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TABLE  7-2: 

DIE  COMPLEXITY  FACTOR 
Chip  Area  (car) 


Sias  (Microns) 

.1-2 

.2-.4 

.4-1.0 

1.0-2 

2-3 

1.00 

2.8 

5.7 

13 

28 

47  1 

1.25 

1.9 

3.9 

9.0 

19 

32 

1.50 

1.3 

2.5 

5.9 

13 

21 

2.00 

.71 

1.4 

3.3 

7.1 

12 

2.50 

.45 

.92 

2,1 

4.6 

7.6 

3.00 

.31 

.63 

1.5 

3.1 

5.2 

However,  since  only  the  metal  and  oxide  failure  rates  are  accelerated  with  the  feature  size/die 
area  factor,  the  factors  in  Table  7.1-2  must  be  decreased  accordingly.  From  the  life  test  data 
presented  in  Table  4-2  (excluding  unknown,  assembly  and  package  failures),  64%  of  die  failures 
are  due  to  metal  and  oxide  (173  failures  out  of 272)  and  therefore  the  actual .  should  be: 


*c“[2r(^)  (;64)]  +'36 


which  yields  the  %q  values  in  Table  7-3. 


TABLE  7-3: 

SHOUT  FORM  MODEL  DIE  COMPLEXITY  FACTOR 


Since  the  detailed  model  assumes  that  the  differences  in  manufacturing  processes  are 

accounted  for  in  the  defect  density  term  and  since  the  short  form  model  wiM  always  use  the  default 

/2  \2 

condition  for  defect  density,  j  ,  there  must  be  a  way  to  incorporate  the  effects  of  the  level  of 


control  in  a  manufacturing  process  into  the  short  form  model.  Hie  rationale  for  including  the  effect 
of  the  manufacturing  process  is  that  companies  on  the  Qualified  Manufacturers  List  (QML)  or  the 
Qualified  Products  List  (QPL)  must  have  demonstrated  to  the  qualifying  activity  that  its  technology 
lias  low  failure  rates  for  time  dependent  dielectric  breakdown,  electromigration  and  hot  carrier 
degradation.  Non  QML  manufacturing  lines  generally  do  not  demonstrate  these  low  failure  rates  in 
such  a  thorough  manner  to  outside  organizations. 


To  incorporate  the  effects  of  manufacturing  process  control,  the  1.0  and  1.2  micron  failure 
rate  data  was  used  and  a  defect  density  required  to  make  the  observed  and  predicted  failure  rates  the 
same  was  calculated.  The  mean  of  these  calculated  defect  densities  was  then  taken  for  each 
manufacturer,  and  a  8.5:1  ratio  was  observed  from  the  best  to  worst  manufacturer.  Therefore, 
assuming  this  range  in  defect  densities  based  on  the  level  of  process  control,  the  following 
categories  cart  be  defined  along  with  their  associated  defect  densities: 


Manufacturing  Process 

D 

QML  or  QPL 

Non  QML  ^  QPL 

.33 

2.5 

JBor  the  summarized  arodei,  these  numbers  must  be  scaled  in  accordance  with  the  following, 
since  only  64%  of  all  die  related  failures  are  typically  due  to  oxide  or  metal.  Table  7-4  presents  the 
final  festers. 


*MFO  m  .36 


TABLE  74: 
ie^fg  VALUES 


Manufacturing  Process 

tcMFG 

QMLorQPL 

Non  QML  or  QPL 

.55 

2.0 

7.1.S  MagfeBiilmEate 

Since  the  detailed  model  package  failure  rate  is  not  a  function  of  time  and  it  is  relatively 
simple  to  use,  it  can  essentially  be  used  as  is  in  the  short  form  model.  One  part  of  the  package 
failure  rate  model  that  was  time  dependent  and  cannot  be  used  is  the  treatment  of  plastic  packages. 
Since  the  short  form  model  cannot  be  time  dependent  another  approach  was  be  taken  to  account 
for  long  term  reliability  degradation  due  to  nonhermetic  effects.  To  accomplish  this,  the  fallout  rate 
of  the  various  package  types  was  modified  by  adding  to  them  a  typical  fallout  rate  for  plastic 
devto  exposed  to  ten A  typical  fallout  rate  for  this  test  is  .4%. 


In  previous  sections  of  this  report,  the  fallout  rates  listed  in  Table  7-5  were  presented  as  a 
function  of  package  type: 


TABLE  7-5: 

PACKAGE  TYPE  FALLOUT  RATES 


Package  Type 

FO  (%) 

Chip  Carrier 

1,03 

Pin  Grid  Array 

.49 

DIP 

.22 
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Data  in  Table  5-21,  which  presents  early  life  fallout  rate  differences  between  plastic  and 
hermetic  parts,  can  not  be  used  to  derive  their  relative  Mure  rates  since  that  data  is  representative 
of  screen  tests  used  to  detect  package  defects.  The  Pi  factor  being  developed  for  plastic  packages 
here  is  intended  to  be  representative  of  its  expected  long  term  reliability. 

Therefore,  to  account  for  plastic  encapsulation,  .4  can  be  added  to  the  fallout  rates  in  Table  7- 
5  to  account  for  long  term  nonhennetic  reliability  effects.  Therefore,  the  fallout  rates  listed  in  table 
7-6  were  used.  Based  on  these  fallout  rates,  a  Jtpj.  can  be  derived  in  the  same  manner  as  in  the 

detailed  model  and  these  values  are  summarized  in  Table  7-7. 

TABLE  7-6: 

FALLOUT  RATE  AS  A  FUNCTION  OF  PACKAGE  HERMETIOTY 


Packs 


Chip  Gamer 
Pin  Grid  Army 
DIP 


Hermetic 

FORaie 

Nonhermstic 

1.03 

1.43 

.49 

.89 

.22 

.62 

TABLE  7-7: 

SHORT  FORM  PACKAGE  TYPE  FACTOR 


Chip  Carrier 
Pin  Grid  Army 
DIP 


TCpr 

Hermetic 

Nonhcnnctic 

4.68 

6.50 

2.23 

4.05 

1 

2.82 

159 


Package  Type  Correction  Factor  (Upy) 


Package  Type 

IlpT 

Hermetic  Nonhermctic 

DIP 

Pin  Grid  Array 

Chip  Carrier 

1.0  2.8 

2.2  4.0 

4.7  6.5 

7.1.6 

To  derive  a  die  base  failure  rate  XBp  for  the  short  form  model,  the  following  procedure  was 

used: 


(1)  A  prediction  was  performed  with  tfte  detailed  model  for  a  variety  of  temperatures, 
screen  dmotious,  die  areas,  and  feature  skes, 

(2)  The  twenty  year  average  Mure  rate  was  calculated  for  cadi  of  these  oanbinatious  Xp. 


(3) 


model: 


*SD( 


hi  ^PAC 

nwn«MW*ri 


^1‘YPE%ISSD^0JSMFG 


SZ«' 


As  described  in  the  detailed  model,  the  failure  rate  is  a  function  of  the  device  type  and 
separates  ths  failure  rate  due  to  oxide  and  metal  The  A-j-ypg  correction  factors  for  Metal  and  Oxide 

therefore  should  be  weighted  (in  accordance  with  the  number  of  failures  expected  for  each)  and 
combined.  For  custom  and  logic  devices; 


Atyps  *  (.77)  +  ,36  (.88) 

».81 


/64%  oxide\ 
\36%  metal/ 


For  memories  and  gate  arrays: 

aTYP£  =  M  (1.23) +  .36  (1.12) 

*  1.19 

Since  this  A^pg  factor  only  effects  the  oxide  &  urstal  faitem  mis  the  actual  factor  should  be 
(since  6 1%  of  all  failures  me  due  to  oxide  or  metal): 

A^pgACTjAL  ^  Atypb  (.64)  +  .36 

Therefore  for  custom  and  lope  devices  Ayy  ps  .83  and  for  memories  and  gate  arrays, 

AyvpE  m  1*12. 

Combining  this  device  type  correctUm  factor  with  the-  base  teihae  rate  results  in  the  following 
base  failure  wzms.  fuacdoa  of  device  type: 


Device  Type 

*BD 

Oistom  and  Logic 
tvfeanoek s  and  Gate  Array 

.02 

.03 

?.L7 

The  ssinpteie  short  form  model  is  presented  oo  the  following  pages  in  a  format  intended  to  be 
replacement  pages  for  MIL*HDBK»217. 
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The  part  operating  predicted  failure  rate  (Xp)  is: 


Xp  =  Xbd^ifg^sdJ^t)  +  ^Bpii^nspiipY + x^os 


where: 

Xp  is  the  Device  Predicted  Failure  Rate  in  F/10^  Hours. 

Xpp  is  the  Die  Base  Failure  Rats: 

For  Logic  and  Custom  Devices,  Xgp  -  .02 
For  Memories  and  Gate  Arrays,  XgQ  -  .03 

nMH3  is  the  Manufacturing  Process  Correction  Factor,  Table  1. 

%  is  rite  Yempoauire  Accekiutiau  Factor,  Table  2. 

H$0  is  the  Dio  Screening  Correction  Factor,  Table  3. 

is  the  Die  Complexity  Corrective  Factor,  Table  4,  based  on  the  Area  (in  cm2)  and  the 
It&lure  Sire  (in  Mjoooe)  of  the  Die. 

Xgp  is  the  Package  Base  Failure  Rate: 

;  Xm  *  .0024  +  (i.SSxHr5)  (&P) 

.  where  1S1F  «*  Nuuib^cd'Package  Pins 

1^:  is  the  Environment  Facto*,  Txble  5. 

HSp  is  the  lockage  Screening  Factor,  Table  6. 

■  istfr  Package  Type  Cocrectioo  Factor,  Table  7. 

XgQ$  is  the  Failure  Rare  due  to  Electrical  Ovcrstress,  Table  8,  based  on  the  Electrostatic 
Discharge  Susceptibility  of  tire  Part,  in  Vdis. 
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TABLE  1: 

Of^pQ,  MANUFACTURING  PROCESS  CORRECTION  FACTOR 


Manufacturing  Process 

nMFG 

QMLorQPL 

Non  QML  or  Non  QPL 

.55 

2.0 

TABLED 

TEMPERATURE  ACCELERATION  FACTOR  (SEE  NOTE  BELOW) 


Tj(°C) 

“nT^1 

Tj  (°C) 

nT 

TjfC) 

nx 

Tj  (°C) 

nT 

25 

i 

5! 

2.S0 

77 

6.73 

103 

14.32 

2? 

1.09 

53 

3.01 

79 

7.16 

105 

15.12 

29 

US 

55 

3.23 

SI 

7.61 

110 

17.25 

31 

1.29 

57 

3.47 

S3 

8.09 

120 

22.24 

33 

1.40 

59 

3.72 

85 

8.59 

125 

25.13 

35 

1.52 

6! 

3.99 

87 

9.11 

135 

31.81 

3? 

1.63 

63 

4.2? 

89 

9.67 

145 

39.80 

39 

1.7  S 

65 

AM 

91 

10.24 

150 

44.35 

41 

1.92 

67 

AM 

93 

10.05 

155 

49.29 

43 

2,08 

69 

*  5.21 

95 

11.48 

165 

60.44 

45 

2.24 

71 

5.56 

97 

12.15 

175 

73.44 

47 

2.42 

73 

5.93 

99 

12.84 

49 

2.60 

75 

6.32 

101 

13.57 

where: 


Tj  is  the  worst  case  junction  temperature  (°K).  Tj  is  estimated  using  the  following 
expression: 


Tj  *  Tq  +  0j£  P  +  273 


where: 

T(2  is  case  temperature  (°C). 

@jC  is  junction  to  case  thermal  resistance  (°C)/watt)  for  a  device  soldered  into  a  printed  circuit 
board.  If  Gjq  is  not  available,  use  a  value  contained  in  a  specification  for  the  closest 
equivalent  device  or  use  the  table  on  Page  5.1.2.7-5. 

P  is  the  worst  case  power  realized  in  a  system  application.  If  the  applied  power  is  not 
available,  use  the  maximum  power  dissipation  from  the  device  specification  or  from  the 
specification  for  the  closest  equivalent  device. 

If  cannot  be  determined,  use  the  following: 


ENVIRO. 

Gb 

°MS 

Gp 

Mp 

nSB 

NS 

% 

nh 

Nuu 

arw 

Aic  Arr  Am 

TC(°C) 

35 

36  . 

45 

50 

40 

45 

4S 

80 

45 

25 

60 

60  60  60 

ENVIRO. 

aif 

AUC 

aut 

4\jB 

aua 

auf 

SF 

Mpp 

mfa 

USL 

ml  cl 

TC(°C) 

60 

60 

95 

95 

95 

95 

95 

45 

60 

50 

40 

60  45 

TABLE  3: 

nSD,  DIE  SCREENING  CORRECTION  FACTOR 


Quality  Level 

nSD* 

D 

i.o 

B 

.94 

S 

.85 

*QML  parts  will  initially  be  exposed  to  Class  B  screening  as  a  baseline.  Further  refinement  of  this 
factor  will  be  made  as  data  becomes  available. 


TABLE  4: 

Hq),  DIE  COMPLEXITY  CORRECTION  FACTOR 


Die  Area  (cm^) 

Feature  Size 

.1-.2 

.2-.4 

.4-LO 

1. 0-2.0 

2.0-3.0 

1.00  Micron 

2.1 

4.0 

8.7 

18 

30 

123 

1.6 

2.8 

6.1 

12 

21 

1.50 

1.2 

2.0 

4.1 

8.7 

14 

2.00 

.81 

1.3 

2.5 

4.9 

8.0 

2,50 

.65 

.95 

1.7 

3.3 

5.2 

3.00 

.56 

.76 

1.3 

2.3 

3.7 
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TABLES: 

%,  APPLICATION  ENVIRONMENT  FACTORS 


ENVIRONMENT 

nE 

ENVIRONMENT 

nE 

Gg 

.52 

Am 

6.8 

°MS 

.88 

aIA 

5.4 

Gp 

3.4 

aIF 

8.1 

°M 

5.7 

AUC 

4.0 

Mp 

5.2 

AUT 

5.4 

nSB 

5.4 

aub 

10 

NS 

5.4 

aUA 

8.1 

Nu 

7.7 

auf 

12 

nh 

8.0 

SF 

1.2 

NUU 

8.6 

Mpp 

5.3 

arw 

12 

mfa 

15 

AIC 

3.4 

% 

17 

Arr 

4.0 

CL 

300 

TABLE  6: 

nSP,  PACKAGE  SCREENING  FACTOR 


Quality  Level 

n$p* 

D 

10 

B 

1.0 

*QML  parts  will  initially  be  exposed  to  Gass  B  screening  as  a  baseline.  Further  refinement  of  this 
factor  will  be  made  as  data  becomes  available. 
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TABLE?: 

ripr,  PACKAGE  TYPE  CORRECTION  FACTOR 


Package  Type 
DIP 

Pin  Grid  Array 
Chip  Carrier 


Hermetic 


1.0 

2.2 

4.7 


Nonhermetic 


2.8 

4.0 

6.5 


TABLE  5.1.2.12-8: 

XE0S>  ELECTRICAL  OVERSTRESS  FAILURE  RATE 


V-pH  (ESD  Susceptibility  (Volts))* 

^EOS 

0-  iooo 

.057 

1000-  2000 

.048 

2000  -  4000 

.040 

4009-  16000 

.034 

>16000 

.025 

_ i 

♦Voltage  ranges  which  will  cause  the  part  to  fail.  If  unknown,  use  0-1000  volts. 
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8.0  FAULT  TOLERANT  RELIABILITY  CONSIDERATIONS 


8.1  INTRODUCTION 

VHSIC  and  VHSIC-Like  Chip  designs  often  use  system  style  architecture  to  handle  a  multitude 
of  functions  such  as  CPU,  RAM,  ROM  and  I/O.  To  model  a  system  that  contains  a  large  quantity 
of  VHSIC  and  VHSIC-Like  chips,  with  a  straight  series  reliability  model,  would  be  inaccurate  if 
the  chip  design  employs  such  fault  tolerant  mechanisms  as  redundancy  and  error  detection  and 
correction  (EDAC). 

To  complete  this  VHSIC/VHSIC-Like  reliability  modelling  effort,  a  wide  variety  of  fault 
tolerant  techniques  used  in  chip  designs  were  reviewed  and  general  modelling  techniques  were 
derived  to  account  for  these  redundancy  effects.  An  attempt  was  made  to  account  for  redundancy 
techniques  in  the  model  by  including  a  pi  factor  similar  to  the  other  factor  used.  This  approach  was 
abandoned  however  since  it  was  determined  that  such  a  factor  would  be  an  oversimplification  of 
the  actual  process.  Therefore,  to  mcJ  A  these  effects  accurately,  each  individual  design  must  be 
analyzed  separately  and  modeled  accordingly.  The  following  discussion  summarizes  the 
techniques  that  can  be  used  to  model  these  effects.  These  techniques  are  based  on  tire  following, 
basic  assumptions; 

(1)  The  die  substrate  failure  rate  is  directly  proportional  to  the  die  area,  (i.e.,  25%  of  the  area 
contributes  25%  of  the  failure  rate). 

(2)  All  die  failures  are  independent 

With  these  assumptions  the  modelling  techniques  are  classified  into  two  basic  categories, 
redundancy  and  EDAC.  Section  8.2  contains  tire  redundancy  models  and  Section  8.3  contains  the 
EDAC  model.  Section  8.4  contains  guidelines  for  the  application  of  the  models  to 
VHSIC/VHSIC-Like  designs. 
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8.2  THE  RELIABILITY  OF  REDUNDANT  DESIGNS 


The  reliability  of  VHSIC  devices  can  be  enhanced  by  the  use  of  redundant  circuit  elements  at 
critical  locations  within  the  chip.  Redundancy  involves  the  use  of  two  or  more  signal  paths 
throughout  the  system  by  the  addition  of  parallel  elements.  It  is  noted  that  in  general  the  addition 
of  redundant  circuit  elements  reduces  the  basic  series  reliability  while  at  the  same  time  increasing 
mission  reliability. 

Depending  upon  the  specific  application  there  are  many  approaches  to  redundancy. 
Redundancy  is  classified  into  two  major  classes;  they  arc: 

(1)  Active  Redundancy  -  External  components  are  not  required  to  perform  the  function  of 
detection,  decision  and  switching  when  an  element  or  path  within  the  chip  fails. 

(2)  Standby  Redundancy  -  External  elements  arc  required  to  detect,  make  a  decision  and 
switch  to  another  element  or  path  within  the  device  as  a  replacement  for  a  failed  element 
or  path. 


An  overview  of  the  techniques  modeled  in  this  report  is  given  in  Figure  8-1. 


FIGURE  8-1: 

REDUNDANCY  TECHNIQUES 
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The  following  sections  contain  a  short  synopsis  of  each  of  the  techniques  mentioned  in  Figure 
8-1.  It  is  important  to  note  that  they  use  a  simple  probability,  that  is,  the  reliability  is  modeled  in 
terms  of  the  number  of  successful  paths  through  the  circuit  subtracting  out  the  successes  that  are 
counted  more  than  once.  For  example,  if  there  arc  two  successful  events  that  could  occur  (Path  A 
and  Path  B)  and  they  arc  independent,  then  the  probability  of  successful  operation  is: 

P(AUB)  =  P(A)  +  P(B)  -  P(A)P(B) 


where: 

P(AUB)  »  The  probability  of  either  Event  A  or  Event  B  occurring 
P(  A)  a  The  probability  of  Event  A  occurring 

P(B)  m  The  probability  of  "went  B  occurring 

s.2.1 

Simple  parallel  redundancy  (SPR)  is  one  of  the  most  widely  used  active  redundancy 
techniques.  If  any  functional  dement  fails  open  then  another  identical  path  exists  through  tire 
redundant  elements.  The  concept  is  visualised  its  Figure  8-2. 


Signal 

Output 


FIGURE  8-2: 

SIMPLE  PARALLEL  REDUNDANCY 

All  elements  are  usually  identical,  but  they  may  be  different  The  probability  model  for  this 
case  is: 

R(t)*l-(l-©k)n 
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where: 


R(t)  s  Reliability  at  timet 
X  3  Failure  rate  of  the  corresponding  chip  area 
n  2  Number  of  redundant  elements 

If  the  elements  are  different,  then  the  reliability  of  each  element  must  be  calculated  separately. 
The  advantages  of  SPR  are  simplicity,  a  significant  gain  in  reliability  from  nonredundant  device 
design  and  its  applicability  to  both  analog  ami  digital  circuitry.  The  disadvantages  are  the  issues  of 
load  sharing,  problems  with  voltage  sensitivity  across  redundant  elements  and  electrical  overstress 
propagation. 

8.2.2  D_upleKi!amUd.SMupdMcy 

Duplex  parallel  redundancy  (DPR)  is  used  in  redundant  logic  applications.  It  is  primarily  used 
in  computer  applications  where  redundant  digital  outputs  are  monitored  by  an  error  detection  device 
such  as  a  parity  checker.  If  an  error  is  detected  the  faulty  output  is  disabled  and  the  system 
function  is  never  interrupted.  Figure  8-3  is  an  illustration  of  a  DPR  application. 


Signal 

Input 


FIGURE  8-3: 

DUPLEX  REDUNDANCY 

In  Figure  8-3  Elements  A  and  B  represent  redundant  logic,  SA  and  SB  are  the  switches  to 
disable  the  1  jic  of  A  or  B  if  they  should  be  found  faulty,  ED  is  the  error  detection  element  and  DL 
is  the  diagnostic  logic  element  If  logic  elements  A  and  B  are  identical  die  reliability  model  would 
be  as  follows: 


R(0  »  Pbd  Pol  <2  Rfi  Rs>2  ®  Row  •  R2oui)1 
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where: 


R(t) 

■ 

Reliability  at  timet 

PED 

a 

Probability  of  emor  detection 

PDL 

3E 

Probability  of  the  diagnostic  logic 

re 

s 

Reliability  of  the  redundant  logic  (A,  B) 

% 

m 

Reliability  of  the  identical  switches,  SA  and  SB 

Rout 

m 

Reliability  of  the  output  circuitry 

^out 

m 

PG01  +  PG02  *GA '  PG01  PG02  ^A 

where: 

PG01 

m 

Probability  first  input  to  OR  gate  gets  through 

pG02 

m 

Probability  second  input  to  OR  gate  gets  through 

^GA 

is 

Reliability  of  die  and  gate 

The  advantages  of  employing  the  DPR  techniques  arc; 

(1)  Application  to  duplex*  active  redundant  modes  o*  separate  elements. 

(2)  Will  maintain  system  function  upton-1  failures. 

(3)  Protects  against  both,  open  and  shmt  failure  modes. 

(4)  Faulty  units  can  be  repotted  to  the  next  higher  level  of  assembly  without  disrupting 
operation. 

Hie  disadvantages  of  DPR  are  increased  complexity  due  re  additional  detection  and  sensing 
circuitry,  increased  storage  capability  required  for  redundant  data  elements  and  additional 
diagnostic  routines. 
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8.2.3  BimodalPaxalleLRedundancv 


Bimodel  Parallel  Redundancy  (BPR)  is  an  active  redundancy  technique  that  combines  both 
series  and  parallel  success  paths.  This  technique  prevents  system  level  failures  by  protecting 
against  shorts  and  opens.  Direct  shorts  across  the  chip  due  to  a  single  element  shorting  is  provided 
by  a  redundant  element  in  series.  An  open  across  the  device  is  prevented  by  the  parallel  elements. 
Figure  8-4  and  8-5  contain  the  block  diagrams  for  the  two  major  types  of  BPR. 


Signal 

Input 


FIGURE  8-4: 

BIMODAL  PARALLEUSERIES  REDUNDANCY 


FIGURE  8-5: 

BIMODAL  SERIES^PAILUi^yREDUNDANCY 


The  parallcbserics  techniques  is  useful  when  the  primary  expected  failure  mode  is  open.  The 
series/parallel  technique  is  useful  when  the  primary  expected  failure  mode  is  short  The  reliability 
calculation  for  the  parallel/senes  case  (assuming  identical  functional  blocks)  is: 


R(0»<2RA-RA2)  (2Rb-Rb2) 


where: 


R(t)  s  Reliability  at  time  t 

a  Reliability  of  the  A  Elements 
Rb  a  Reliability  of  the  B  Elements 

The  reliability  calculation  for  the  scries/paraliel  case  (assuming  identical  functional  blocks)  is: 

R(t)  =  2RARB-(RARB)2 


where: 

R(t)  m  Reliability  at  time  t 

R^  a  Reliability  of  the  A  elements 

Rg  ®  Reliability  of  the  3  elements 

The  major  advantage  of  die  BPR  techniques  that  have  been  described  is  that  it  provides  u 
significant  gain  in  reliability  at  the  chip  level  (and  therefore  system  level)  for  short  mission  times. 
Another  advantage  is  that  it  can  provide  greater  protection  against  particular  failure  modes.  Its 
major  disadvantages  are  that  it  is  difficult  to  design  at  the  chip  level  and  for  long  mission  times  it 
can  actually  be  less  reliable  titan  a  non  redundant  design. 

8.14  Mamy^ciiim,Rgilujiilaii£y 

Majority  voting  redundancy  (MVR)  can  be  implemented  in  two  ways;  the  straight  MVR 
technique  and  an  enhanced  adaptive  minority  logic  technique.  The  idea  behind  the  basic  MVR  is 
that  decision  logic  cm  bo  built  into  the  SPR  model  by  inputting  signals  from  the  parallel  elements 
to  a  voting  element  to  compare  each  signal  with  tlie  remaining  signals.  Valid  decisions  are  made 
only  if  the  number  of  useful  elements  exceeds  the  number  of  failed  elements. 


The  adaptive  majority  logic  technique  uses  the  basic  MVR,  but  utilizes  a  comparator  and 
switching  network  to  switch  out  or  iniubit  failed  redundant  elements.  This  enhancement  reduces 
the  possibility  of  a  majority  of  bad  elements  determining  the  vote.  Figure  8-6  and  8-7  contain  the 
block  diagrams  for  the  two  MVR  techniques. 
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FIGURE  8-6: 

MAJORITY  VOTING  REDUNDANCY 


FIGURE  8-?: 

MVR  Willi  ADAPTIVE  ELEMENT'S 


Tbs  MVR  reliability  model  afcmficiag  til  Kdendant  element  axe  identical  is: 


m 


e~*MVRt 


where: 


R(t)  s  Reliability  at  time  t 

»  Failure  rate  of  a  single  redundant  elements 
XMVr  a  Failure  rate  of  the  voting  element 

n  s  The  number  of  redundant  elements  minus  the  minimum  number  of  elements 
required 

The  model  can  be  added  to  so  that  it  can  be  applied  to  the  adaptive  MVR  technique.  The 
reliability  of  the  comparator  and  sensing  circuity  must  be  added  as  well.  Since  there  are  so  many 
possible  implementations  of  this  technique  the  derivation  of  specific  model  is  left  to  the  user. 


The  advantages  of  using  the  MVR  techniques  are  that  it: 

(1)  Can  be  implemented  to  provide  indication  of  faulty  signals. 

(2)  Can  provide  significant  gains  in  reliability  for  slant  missions. 

The  disadvantages  are  that  to  be  effective  it  requires  the  voting  element  to  have  a  much  greater 
reliability  than  the  redundant  elements  and  that  in  some  cases  for  long  mission  times  it  can  produce 
a  lower  reliability. 

8.2.5  Qm.£omz*M.  EsaMaass 


Gate  connector  redundancy  (GCR)  is  a  voting  type  of  redundancy  similar  to  MVR.  It  is 
primarily  used  its  digital  circuitry  where  redundant  elements  require  a  vote,  but  not  as  significant  a 
voting  mechanism  as  those  used  in  MVR.  Outputs  of  the  redundant  elements  are  fed  to  switch-like 
gates  which  perform  the  voting  function. 


The  gates  contain  no  cells  whose  failure  would  caus  redundant  circuit  elements  to  fail. 
Figure  &-S  contains  the  block  diagram  for  the  OCR  technique. 


FIGURE  8-8: 

GATE  CONNECTOR  REDUNDANCY 
The  reliability  model  for  this  technique  is  as  follows: 

R(t)  *  R1&2R3R4  4-  RjRjRjQ*  +  alR2Q3R4  +  +  Qi  R2R3R4 

are: 

R(t)  a  RsbabUity  at  time  t 

Rj  a  lhilial)UUyQfpatlitlaougltAj 

Rl 

Ql  *  1-Rj  (Q  =»  Unreliability) 

R2  ®  Reliability  of  path  through  A2 

Rj»RAik33 


R-3  s  Reliability  of  path  through  A3 

R3  =  R  A  &G 
Q3  =  l-R3 

R4  s  Reliability  of  path  through  A4 

r4  =  rA  rG4 
Q4  =  1*R4 

The  gate  connector  technique  is  usually  only  used  when  the  gates  used  to  provide  the  voting 
function  have  extremely  high  reliabilities.  The  advantages  and  disadvantages  of  GCR  are  the  same 
as  MVR. 


8.2.6  gMfeBsdufiijansx 

Standby  redundancy  techniques  include  implementation  for  both  operation  and  nonoperating 
modes.  Standby  techniques  do  not  employ  any  load  sharing.  As  soon  as  a  faulty  element  is 
detected  then  another  element  is  switched  in  its  place.  Operating  standby  redundancy  (OSR) 
allows  the  redundant  elements  to  remain  active  while  nonoperating  standby  redundancy  (NSR) 
allows  the  redundant  dements  to  remain  inactive.  At  tire  time  of  this  writing  only  OSR  techniques 
are  employed  as  it  is  not  possible  to  activate  or  nower  only  portions  of  a  monolithic  substrate. 
OSR  is  illustrated  in  Figure  8-9. 


Signal 

Input 


Signal 

Output 


FIGURE  8-9: 

OPERATING  STANDBY  REDUNDANCY 

The  ate  the  redundant  elements,  the  Dj  are  the  sensors  which  detect  failures  and  Sj  is  the 
switch.  The  tcliabilW  model  for  OSR  is  as  follows: 


where: 


R(t) «  e 


*A« 


»-1(XA»r  ' 

y 

M  r  J 


R(t)  m  Reliability  at  timet 

Xa  si  Failure  rate  of  the  active  element  and  detector 

n  »  Number  of  active  dements 


The  advantages  of  OSR  are  drat  it  is  applicable  to  both  analog  and  digital  circuitry  and  it  is 
effective  in  protecting  against  failure  due  to  intermittent  failure  modes.  Its  disadvantages  are  the 
increased  delay  time  due  to  sensing  and  switching  functions,  increased  complexity  and  limitations 
on  maximal  reliability  gains  due  to  the  failure  modes  of  the  sensing  and  switching  devices. 
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8.3  ERROR  DETECTION  AND  CORRECTION  RELIABILITY  MODELING 


VHSIC  and  VHSIC-like  devices  sometimes  utilize  error  detection  and  correction  (EDAC) 
circuitry  to  improve  the  reliability  of  Random  Access  Memory  (RAM).  EDAC  uses  error 
correction  codes  (ECCs)  such  as  the  Hamming,  Golay  and  Binary  Coded  Hexidecimal  (BCH)  to 
add  redundancy  to  the  stored  data  elements.  When  the  data  is  written  to  a  memory  location  a  code 
value  is  calculated  and  stored  with  it.  When  the  data  is  read  the  code  is  retrieved  with  it  and  the 
value  of  the  code  is  regenerated.  If  the  two  values  are  not  equal  the  entire  stored  word  is  used  to 
generate  a  new  (and  hopefully)  correct  word.  Though  there  are  many  types  of  ECCs  this  study 
considered  only  single  error  correction/double  error  detection  (SEC/DEC)  codes.  This  type  of 
EDAC  is  implemented  as  shown  in  Figure  8-10. 


FIGURE  8-10: 
EDAC  BLOCK  DIAGRAM 


After  reviewing  a  multitude  of  EDAC  reliability  assessment  models  it  became  apparent  that  as 
reported  in  “Large  Scale  Memory  Error  Detection  and  Correction  Study,"  (Reference  79),  the 
models  can  become  quite  complex.  As  reported  in  tiuu  study  the  Levine  and  Meyers  EDAC  model 
is  an  excellent  approximation  that  is  relatively  easy  to  use.  The  model  is  presented  as  follows  with 
one  natural  extension  for  chip  level  application. 
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RECcW  “  (RRAMm  +  mRRAMm  l  (*'RRAM^REDAC 


RECc(0  55  Reliability  of  RAM  at  time  t 
%AM  53  Reliability  of  the  RAM  as  a  function  of  the  substrate  failure  rate: 

RRAM  =  exP  ^'^-RAM *) 

redac  -  Reliability  of  the  additional  substrate  used  to  implement  ED  AC: 

^EDAC  *  exP  (*^EDAC  l) 

m  m  Total  number  of  d=tta  bits  m  =  n+k,  n  »  data,  k  =  code 

The  failure  rates  for  the  RAM  and  EDAC  functions  are  determined  by  the  percentage  of 
substrate  area  they  occupy.  If  the  RAM  takes  up  50%  of  the  substrate  then  it  experiences  ,50%  of 
the  substrate  failure  rate. 

For  multiple  ECGs  there  ate  models  that  can  \m  applied  for  which  the  reader  is  referenced  to 
RADC-TR'87-92  for  mom  detail 
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8.4  SUBSTRATE  LEVEL  RELIABILITY  MODELING  GUIDELINES 

The  reliability  models  described  in  Sections  8.2  and  8.3  should  be  applied  with  guidelines  put 
forth  in  this  section.  The  system  level  techniques  that  are  applied  below  the  chip  level  have  a  major 
difference.  Instead  of  summing  piece-part  failure  rates,  a  percentage  of  the  substrate  failure  rate  is 
used.  That  percentage  is  equal  to  the  percentage  of  substrate  real-estate  that  each  respective 
function  occupies.  Figure  8-11  illustrates  this  concept 


SUBSTRATE  REAL  ESTATE  PERCENTAGES 

Gives.  that  the  reliability  of  the  RAM  has  been  improved  by  the  EDAC  circuitry  and  that  they 
topther  take  50%  of  the  substrate  area,  the  reliability  of  the  RAM  would  be  calculated  using  50% 
of  tire  predicted  substrate  failure  rate . 

&  summary,,  the  reliability  modeling  of  VHSIC  and  VHSIC-like  device  follows  die  following 
sequence: 

<1 )  Obtain  all  the  required  information  on  the  device,  its  design  and  its  application. 


(2)  Calculate  the  failure  rare  of  the  substrate  and  its  packaging. 
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(3)  Determine  if  the  architecture  of  the  device  includes  any  redundancy  or  EDAC 

(4)  If  it  does  not,  add  the  die,  package,  and  EOS  failure  rates  and  the  calculation  is  complete. 
If  it  does,  proceed  to  step  (5). 

(5)  Select  the  appropriate  model  for  the  type  of  redundancy  employed. 

(6)  Calculate  the  percent  of  substrate  that  the  redundancy  implementation  consumes. 

(7)  Calculate  the  reliability  of  the  redundant  items  and  the  nonredundant  items. 

(8)  Calculate  the  chip  level  reliability  as  follows: 

P*(t)  s*  Rj^m  RpKG 

where: 

R(t)  &  Reliability  at  time  t 
Rjy  se  Reliability  of  the  redundant  Items 
Rnri  a  Reliability  of  the  nonredundant  items 
Rpkq  m  Reliability  of  the  package 

The  chip  level  reliability  is  viewed  as  illustrated  in  Figure  8-11 


FIGURE  8-12: 

CHIP  LEVEL  RELIABILITY  BLOCK  DIAGRAM 

If  there  is  no  redundancy  employed  1009b  of  the  substrate  failure  rate  is  used  to  calculate  the 
chip  level  reliability. 
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9.0  MODEL  VALIDATION 


All  data  collected  in  this  (continued  in  Appendix  D)effort  was  used  in  derivation  of  the  model 
parameters.  Although  the  majority  of  this  data  was  not  from  1.25  micron  VHSIC  technology,  but 
rather  2.0  micron  average,  it  was  necessary  to  use  it  all  to  insure  statistically  sound  parameters. 


The  methodology  used  therefore  was  to  validate  and  refine  the  model  based  on  the  1.0  and 
1.25  micron  data  that  was  available.  Table  9-1  summarizes  the  results  of  this  validation  effort.  All 
data  available  for  these  devices  was  accelerated  life  test  data  taken  at  125°C,  150°C  or  200°C 
ambient  temperatures. 

Table  9-1  lists  tire  number  of  actual  part  hours,  the  number  of  devices  tested,  the  number  of 
failures,  the  observed  fai’nre  rate,  the  duration  of  the  test,  the  predicted  average  (over  the  test 
duration)  failure  rate  using  the  detailed  model  and  the  predicted/observed  failure  rate  ratio.  The 
predicted  values  were  obtained  by  using  a  ground  benign  environment  with  a  duty  cycle  of  1.00. 


The  mean  and  standard  deviation  of  die  log  ~~  values  yields  -.056  and  .43,  respectively. 


Since  the  mean  of  the  logged  values  is  -.056,  the  actual  mean  of 


is  .88,  indicating  that  the 


model  on  the  average  is  12%  optimistic.  However,  the  zero  failure  data  was  not  used  in  this 
analysis,  which  would  tend  to  cause  somewltat  pessimistic  observed  failure  rate  values.  The  zero 
failure  data  accounted  for  11. 5  percent  of  the  total  observed  part  hours  and  therefore  the  observed 
failure  rates  should  be  approximately  11.5%  lower  than  the  values  used  in  this  analysis  since  only 
data  with  observed  failures  were  used.  It  should  also  be  noted  here  that  a  12  percent  deviation  is 
very  small  compared  to  tire  natural  failure  rate  variability. 
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TABLE  9-1: 

MODEL  VALIDATION  DATA 


Part  Hours 

#  Tested 

#  Failed 

X,  Observed 

F/lCtf  hrs. 

Test 

Duration  hrs. 

Xp 

Xp 

Xo 

24,900 

155 

1 

40 

168 

50.4 

1.26 

18,000 

113 

1 

55 

168 

48.4  " 

.88 

17,000 

107 

2 

117 

168 

43.9 

.37 

15,600 

97 

0 

0-64 

168 

59.4 

.37 

21,100 

132 

0 

047 

168 

37.9 

.37 

22,600 

141 

3 

133 

168 

27.8 

.21 

20,100 

126 

1 

50 

168 

10.0 

.20 

34,600 

216 

0 

0-29 

168 

24.5 

.20 

18,000 

112 

1 

56 

168 

11.0 

.20 

27,500 

172 

i 

36 

168 

20.5 

.57 

32,500 

203 

0 

31 

168 

21.0 

.68 

78,600 

491 

3 

38 

168 

21.0 

.55 

25,100 

157 

0 

040 

168 

13.6 

,5.1 

40,900 

255 

1 

24 

168 

23.0 

.96 

54,000 

54 

4 

74 

1,000 

81.9 

1.11 

2,660,000 

54 

11 

4 

168 

-  i  li 

19.8 

4.95 
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TABLE  9-1: 

MODEL  VALIDATION  DATA  (CONTD) 


Part  Hours 

#  Tested 

#  Failed 

X  Observed 

F/106  hrs. 

Test 

Duration  hrs. 

Xp 

Xp 

Xo 

39,312 

234 

3 

76 

168 

37.4 

.49 

710,000 

510 

20 

28 

30.4 

1.09 

218,000 

218 

4 

18 

1,000 

30.4 

1.69 

497,000 

497 

3 

6 

1,000 

30.4 

5.07 

29,064 

173 

1 

34 

168 

37.4 

1.1 

191,000 

191 

2 

10 

1,000 

30.4 

3.04 

706,000 

706 

7 

10 

1,000 

30.4 

3.04 

^.712 

284 

0 

0-21 

168 

37.4 

3.04 

349,000 

698 

0 

0-3 

500 

34.4 

3,04 

48,048 

286 

1 

21 

168 

37.4 

1.78 

49,500 

99 

0 

0-20 

500 

34.4 

i.78 

46  240 

80 

2 

43 

580 

9.1 

.21 

182,090 

315 

0 

0-5 

580 

9.1 

.21 

3,555 

42 

0 

0-281 

84 

91.9 

.21 

1,232 

IS 

0 

0-S12 

68 

98.4 

,21 

2,076 

34 

0 

0482 

61 

9S.4 

.21 

The  conclusion  of  this  exercise  is  that  the  detailed  model  accurately  predicts  the  failure  rate  as  it 
stands  and  does  not  need  to  be  modified. 

There  appears  to  be  a  relatively  large  variance  in  observed  reliabilities  between  manufacturers 
but  relatively  little  for  a  particular  manufacturer.  For  example.  Figures  9-1  and  Figure  9-2  from 
(Reference  14)  illustrate  the  variability  that  can  be  expected  from  a  well  controlled  product  line  in 
which  a  variance  of  a  factor  of  approximately  two  is  observed.  The  data  from  the  VHSIC  database 
indicates  that  an  order  of  magnitude  variation  is  not  untypical,  especially  between  manufacturers. 
Several  factors  can  account  for  this,  particularly  the  fact  that  the  actual  defect  densities  were  not 
known  for  this  data,  and  probably  varied  significantly  between  manufacturers.  This  further 
illustrates  uie  significance  of  defect  density  as  ah  indicator  of  reliability. 

Another  exercise  was  undertaken  in  which  it  was  assumed  that  the  observed  failure 
differences  between  manufacturers  was  attributable  to  differences  in  defect  density.  In  this 
analysis  a  defect  density  was  calculated  for  each  failure  rate  which  made  the  observed  equal  the 
predicted  failure  rate.  A  mean  of  these  defect  densities  were  then  calculated  for  each  manufacturer 
and  ranged  from  .3  to  2.57  defects  /c m2.  The  failure  rate  prediction  was  then  performed  again 
with  this  "customized"  defect  density,  and  the  log  of  the  prcdictcd/observed  ratio  was  calculated. 
The  standard  deviation  of  these  values  was  in  this  case  .27  as  opposed  to  .43  as  in  the  case  of  the 
"uncustomized"  defect  densities. 


A  summary  of  the  standard  deviations  of  live  (log-^* )  values  for  a  typical  regression  model 

Xq 

(Reference  78),  the  detailed  model  with  Dal,  and  the  detailed  model  with  customized  defect 
density  are  given  in  Table  9-2. 


FIGURE  3-1: 

MOTOROLA  MtCRGmOCESStM  PRODUCT GROUP  HIG&  TEMFSRATURE 
OPERATING  LIFE  1984-1 985  -TOTALS 


CYCLE  1984  -  1935  TOTALS 


TABLE  9-2: 

STANDARD  DEVIATION  SUMMARY 


Model  Type 

0 

Regression  Model 

.47 

Detailed  Model  (D=l) 

.43 

Detailed  Model  (Custom  D) 

.27 

An  analysis  like  this  was  not  performed  on  the  short  form  model  since  it  was  intended  only  to 
be  used  for  field  failure  rate  predictions  and  extrapolating  it  to  high  temperature  operating  life 
conditions  is  questionable.  It  is  believed  however,  that  its  precision  should  be  comparable  to  a 
typical  regression  model,  since  these  two  model  types  have  similar  forms  and  complexities. 
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10.0  SAMPLE  CALCULATIONS 


This  section  of  the  report  presents  various  sample  calculations  using  the  detailed  model  with 
various  input  variables.  The  failure  rate  graphs  in  this  section  were  generated  by  a  computer 
program  that  was  written  to  perform  predictions  using  the  detailed  model.  The  first  illustration  in 
each  figure  is  the  input  screen  in  which  the  user  specifies  the  input  variables  to  be  used.  A 
description  of  each  of  these  fields  taken  from  the  users  manual  to  this  program  is  given  i 
Appendix  C.  The  second  illustrates  the  calculated  failure  rate  as  a  function  of  time  and  plots  these 
values.  The  dashed  lines  represent  (from  top  to  bottom)  the  highest  predicted  value,  mean 
predicted  value  and  the  lowest  predicted  value,  all  for  a  20  year  period.  The  first  twelve  years  of 
operation  only  are  plotted. 

The  input  variables  that  were  varied  for  these  predictions  were  Die  Area,  Feature  Size, 
Temperature,  Duty  Cycle,  and  Screening  Duration.  Table  10-1  summarizes  the  values  of  each  of 
these  variables  for  each  example. 

It  can  be  seen  in  most  of  these  failure  rate  plots  that  the  failure  rate  is  almost  always 
continuously  decreasing  throughout  the  useful  life  of  the  device,  and  that  it  takes  a  very  long  time 
to  reach  the  constant  failure  rate  portion  of  the  curve.  Tnis  is  specially  true  in  benign,  low 
temperature  applications  where  many  defects  are  not  accelerated  to  failure  in  the  early  life  of  the 
device.  Conversely,  the  model  indicates  in  high  temperature  applications,  that  the  failure  rate  is 
initially  very  high  but  decreases  rapidly  duo  to  the  fact  that  defects  are  being  removed  at  a  high  rate. 

It  is  also  apparent  in  this  model  that  die  area  and  feature  size  heavily  influence  the  predicted 
failure  rate,  as  well  as  defect  density. 


TABLE  10-1: 

EXAMPLE  PREDICTIONS  INPUT  VARIABLES 


Duty 

Cycle 

Ambient 

Temperature 

(°C) 

Screening 

Duration 

(hrs.) 

.50 

50 

168 

.50 

50 

168 

.50 

50 

168 

1.00 

25 

168 

.50 

25 

168 

1.00 

25 

168 

1.00 

75 

168 

1.00 

125 

168 

1.00 

25 

168 

1.00 

75 

168 

1.00 

125 

168 

1.00 

25 

168 

1.00 

25 

168 

1.00 

25 

168 

1.00 

25 

168 

1.00 

25 

168 

1.00 

25 

500 

1.00 

25 

1000 
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EXAMPLE  1 


UHSIC/VHSIC  Like  Model 


Package 

Ki 


Pins 

m 


a  Unit 
a 


Dev  Type 

U 


Die  Area  Theta  Ja 

BUG  t£I23 


Feat  Size 

MKH 


Defect  Den 


ESD 

urn 


Met  Type 

BJ 


Scm  Duration 

k  mmm 


Tew  Pur 


Tew 

M 


Act  Pur 


Curr  Den  _ 


ElctFld  Env 

U 


RHt*d 

awa 


SuLst  I 

mm 


Drain  I 


r  Oxide  r  Metal 


Sf" 

r  Hot  Carr 


Instant  Tine 


Instant  Laxbdi 


Factor 

Value 

Oxide 

0.010785 

TDDB 

1.3218405347E-15 

Metal 

0.001522 

EM 

0.024540 

Hot  C 

3.5837777503E-24 

Cont 

0.000260 

Pack 

0.113872 

ESD 

0.053286 

Misc 

0.012643 

Tine 

0.262800 

LaJtbda 

0.216907  | 

1  Average  failure  rate:  1 

0.274513 

|  Any  key  to  v leu  chart. 
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EXAMPLE  2 


UHS1C/VHSIC  Like  Model 


Part  Number _  Dose  Mfr  Package  Pins 

i!  ti  ti  m 

Complexity  I  Unit  Dew  Type  Die  Am  Theta  Ja 

wamm  a  «  ckb  kies 

Feat  Size  Defect  Den  Nfg  Pro  ESD  Het  Type 


Complexi 


mm q  a 

ion  Temp 


■Pffl  ti 


Scrn  Duration  Temp 

h  mamm  m 

Tmp  Act  Pur  Curr  Den 


Elct  Fid  Env  RHmd 

mm  i]  aia 


SuLst  1  Drain  I  Dty  Cyc 

aaaaaa  mms  a& 

r  Oxide  r  Hetal  r  Hot  Carr 


Instant  Time 
Instant  Lambda 


Factor 


Time 

Laml>da 


0.253924 

6.6092226733E-15 

2.227610 

2.224540 

3.5837777503E-24 

0.202262 

0.113872 

0,253286 

0.012643 

2.262820 

2.266133 


Awerage  failure  rate: 
0.567195 


Any  key  to  view  chart. 


Instantaneous  Failure  Rate 


1.158272 
8.965227 . 

8.772182 • 
8.579116. 

8.386891- 

8.193845 


"■fSJ*  '  i  1  i  •  i  1  i  1  i  1  i  1  i  1  i  1  i  1  i  1  i  •  i  1  i 

in  Vears  8  1  2  3  4  5  6  7  8  9  18  11  12 

Average  Lambda  Obser.  -  Lower  Upper  Tears  to  Fail 

8.567195  8.266133  8.965227  38 

Press  'E'  to  exit  or  any  other  key  to  view  the  next  set  of  data  points. 


EXAMPLE  3 


VHSIC/VHSIC  Like  Model 


Part  Nunier 

amaa 


Package 

ti 


Pins 


Conplexity 

■BiSEs 


I  Unit 

ti 


Dev  Iype 

ti 


Die  Area  Theta  Ja 
HB3  ram 


Peat  Size 

ili*« 


Defect  Den 

mitn 


Mfg  Pro 
il 


ESD 

tea 


Met  Type 

ti 


Scrn  Duration 

k  mamm 


Ten?  Pur 


Act  Pur 


Ten? 
i£h 

Curr  Den 


ElctFld  Env 

tl 


Hind 

5*H 


Suhst  I 

mmz 


Drain  I 
tUSMM 


r  Oxide  r  Metal 


Sb* 

r  Hot  Carr 


Instant  Tine 


Instant  knkda 


Factor 

Value 

Oxide 

0.107847 

TDDB 

1.3218405347E-14 

Metal 

0.915220 

EM 

0.024540 

Hot  C 

3. 5837777503E-24 

Cont 

0.000260 

Pack 

0.113872 

ESD 

0.053286 

Nisc 

0.012643 

Tine 

0.262800 

Lanhda 

0.327667 

Average  failure  rate: 
Q. 933048 


Any  key  to  view  chart,  j 


EXAMPLE  4 


Part  Nuxher 

amasi  _ 


UHSIC/UHSIC  Like  Model 


Complexity  ft  Unit 
■■LEERS  U 


Dew  Type 

ti 


Pins 

m 

Die  ftpea  Iheti  Ja 
CBS  KITH 


Feat  Size 

uke 


Defect  Den 


Mf?  P M 
<1 


ESD 

vm 


Net  Type 

Bi 


Scm  Duration 
Bi  MHtWmi 


Teep 

m 


Pur 


ii 


Act  Pur 


Curr  Den 


EIctFld  Env 

|] 


RHnd 

JSS 


Suhst  I 

mi m 


Drain  I 

aaaaBSB 


r  Oxide  r  Metal 


Stf" 

r  Hot  Carr 


Instant  Title 


t, 


Instant  Laxfcda 


Factor  Value  | 

Oxide  0.818971 

TDDB  1,944141 0446E-14  1 
Metal  0.004718  * 

EN  1.88512S9788E-18 
Hot  C  8. 4182293786E-21 
Cent  0.080133 

Pack  0.113872 

ESD  0.0S3286 

Misc  0.006201 

Tine  0.262800 

Laxfcda  0.197181 


Average  failure  rate: 
0.431460 

Any  key  to  vieu  chart. 
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EXAMPLE  5 


=1  VKSIC/UHSIC  Like  Model 


Part  Nunier 

'»r 

Cowlgxita 

■MUSSES 

Feat  Size 

ima 


Package  Pins 

Ki  m 

4  Unit  Dev  Type  Die  Am  Theta  Ja 

it  Bj  taa  tuts 


Defect  Den 


Mfg  Pm 
<1 


tern  Duration 

h  tamm 


Tear  Pur 


Act  Pur  Curr  Den 


ESD  Met  Type 
I ffiS  Wi 


mud 

5®3 


Elet  Fid  Env 

mnwm  u 


SuJbst  I  Drain  I 

arcana 

r  Oxide  r  Metal 


Stf" 

r  Hot  Carr 


Instant  Tine 
■mpjjjjju 

Instant  Lanhda 


Factor 

Value 

Oxide 

0,053956 

TDDB 

1.1888458255E-16 

Metal 

0.003584 

EM 

8. 2267213373E-23 
3, 1705479073E-23 

Hot  C 

Cont 

q  oaocK’j 

Pack 

0.113872 

ESD 

0.053286 

Misc 

0.011529 

Tine 

0.262800 

Lanbda 

0.236294 

Average  failure  rate: 
0.371757 

Any  key  to  view  chart. 


EXAMPLE  6 


VHS1C/VHSIC  Like  Model 


Package 

ti 


Pins 


Conplexitu  4  Unit 
wM  U 


Dev  Type  Die  Area  Theta  Ja 

ti  tas  ass 


Feat  Size 

USES 

Scrn 

ti 


Defect  Den 
iirnttw 


<*.  <'<*<* lbo 


Duration 

mtm 

Tnp  Act  Pur 


Tew 

M 

Curr  Den 


Mfg  Pro 
3 

Pur 


<*.  0 


EJD  Net  Type 
IBS  ti 


Bind 

SB 


ElctFld  Env 

- u 


Suhst  I 

-mm 


Drain  1 


Dty  Cyo 


r  Oxide  r  Metal  r  Hot  Carr 


Instant  Tine 


Instant  Laxhda 


Factor 

Value  “| 

Oxide 

0,818971 

TDDB 

1.94414104481-14  j 

Metal 

8.004718  f 

» 

1.0851259788E-18 

Hot  C 

8.4182293786E-21 

Cont 

0.000133 

Pack 

0.113872 

ESD 

0.853286 

Misc 

0.006201 

Tine 

0.262800 

Lanhda 

0.197181 

1  Average  failure  rate: 

0.431460  1 

1  Any  key  to  view  chart. 
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EXAMPLE  7 


Part  Hunter 

wr.mn  ~ 

Conplexity  i  Unit 

ll 


VHSIC/VHSIC  Like  Model  \ - 

Package  Pins 

1}  W 


Dev  Type 

Wi 


Die  Area 

tsa 


Feat  Size 

MBS 


Defect  Den 


Mfg  Pro 
<1 


ESD 

IKS 


Theta  Ja 

Net  Type 

K 


Scm  Duration 


<■.  <Y«Ub 


Tenp  Pwr 


£ 


Act  Pur 


11  !! 

Curr  Den 


Elct  Fid  Env 

fi 


fflnd 

aa 


Subs t  I  Drain  I 

mamm  mam 


r  Oxide  r  Metal 


r  Hot  Carr 


Instant  Tine 


Instant  Lanhda 


Factor 

Value  | 

Oxide 

2. 8230992648E-87 

TDDi 

1.7111589750EH39 

Metal 

1, 6091920696E-O7 

EM 

0,004214 

Hot  C 

1.4914253023E-24 

Cont 

0.010565 

Pack 

0.113872 

ESD 

0.053286 

Misc 

Vi 

Tine 

0.262800 

Lanhda 

0.181942 

Average  failure  rate: 
9. 528288 

Any  key  to  vieu  chart. 


Instantaneous  Failure  Kate 


Average  LanMa  OLser.  -  Loner  Upper  Years  to  Fail 

0.528288  0.188896  3.403151  38 

Press  'E'  to  exit  or  any  other  key  to  vieu  the  next  set  of  data  points. 
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EXAMPLE  8 
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EXAMPLE  9 


Put  Hunter 

III '  "i  — 


UHSICAIHSIC  Like  liodei 


D«sc  Mfr  Package 

U  U  Kl 


FiCtOP 


Conplexity  1  Unit  Dev  Type  Dip  Am  Theta  Ja 

tMimn  u  ti  KSfi  ram 

Feat  Size  Defect  Den  Hfg  Pro  ESD  Net  Type 

utm  mam  a  urn  a 

Scpn  Duration  Tens  Pup 

CJ  ■HRWHU  m  ■EBB 

Tnj>  Act  Pup  Cupp  Den  Elct  Fid  Env  RHnd 

_  wmass  u  s  *a 


Tine 

Lantda 


a  Q47a?e 

4.8603526115E-14 

3.911795 

1.0851259788E-18 

8, 4182293786E-21 
3.003133 
0.113872 
0.353286 
0.006201 

0.262800 
0. 232714 


Subsj  I  ,  Drain  I  Dty  Cyc 

'WESHm  immn  y fa 

r  Oxide  r  Hetal  r  Hot  Carp 


Instant  Tine 
Instant  Lambda 


Average  failure  rate: 
0. 805806 


Any  key  to  vieu  chart, 


Instantaneous  Failure  Rate 


2.109648 

1.750533 

1.400426 

1.059320 

0.70O213 

0.350107 


eQQQAAA 

e  vown 

Tine 

in  Years 


1  2 


5  6 


8  9  10  11  12 


Avepage  Lanbda  Obser,  -  Loner  Upper  Years  to  Fail 

0.805806  0.232714  1.750533  30 

Press  'E*  to  exit  or  any  other  key  to  viee  the  next  set  of  data  points. 
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EXAMPLE  10 


UHSIC/VHSIC  Like  Modi. 


Part  Hunker 
'MM 


Package 

Kl 


Feat  Size 

MBS 


Defect  Den 
tmiwa 


Sent  Duration 

«  ■BWMWI 


is 


Act  Pur 


Suhst  I 

naan* 


r  Oxide  r  Metal  r  Hot  Carr 


T 


Pins 

933 


Theta  Ja 

worn 

Met  Type 

W 


Instant  Tine 


Instant  Lanhda 


Factor  Ualue 


Oxide 
TDDB 
Metal 
EM 
Hot  C 
Cont 
Park 
ESU 
Misc 

Tine 

Lanhda 


7.0577481621EH37 
4.2778974375EH39 
4. 0229801739E-07 
0.004214 
1.4914253023E-24 
0.010565 
0.113872 
0.053286 

0QQVY34 
» 

0.262800 

0.181943 


Average  failure  rate! 
1.002181 

Any  key  to  vieu  chart. 


EXAMPLE  11 


Part  Nuaber 

fflMT 

Complexity 

WKtitmn 


VHSIC/VIIS1C  Like  Model  |= 

Package 

ftl 


I  Unit 
U 


Dev  Type 

B 


Pins 

m 

Die  Area  Theta  Ja 
M«  MJtn 


Peat  Size  Defect  Den  Kfy  Pro  CSD 
HKS  aasHWl  3  4G33 


Met  Type 

B 


Scm  Duramen 

B 


T»p  Act  Pur 


Tenp  Pur 


leap 

m 

Curr  Den 


ElctFld  Env 

U 


RHad 

SStB 


Sllbs  t  I 

attaas 


Drain  I 

aaaapa 


Cyc 


r  Oxide  r  Metal 


ptyc 

ItfW 

r  Hot  Carr 


Instant  Tiae 


Instant  Laabda 


20 


Factor 

Value 

Oxide 

22. 755301 

TDDB 

8. 4361450493E-18 

Metal 

END  OF  LIFE 

EM 

END  OF  LIFE 

Hot  C 

o  QCCOGQ 

Cent 

1. 046G60 

Pack 

0.113872 

ESD 

0. 053288 

Misc 

0,927430 

Tine 

0.000001 

Laabda 

END  OF  LIFE 

TOO  HU  POINTS  TO  PLOT 


Any  key  to  continue. 


EXAMPLE  12 


i - - -—-I  UHSIC/VHSIC  Like  Model  E— = 

Part  Nunher _  Desc  Mfr  Package  Pins 

ft!  U  K1  m 

alexity  i  Unit  Dev  Type  Die  Area  Theta  Ja 

fS2£  ll  ti  MS  ««« 

Feat  Size  Defect  Den  Mfg  Pro  ESD  Met  Type 

■IBS  aa$JtB  <J  ftEE  K 


Factor 


0, 094853 


Scrn  Duration  leap  Pwr 
El  M  WKXTH 

Tnp  Oct  Pur  Curr  Den  ElctFld  &v  RHnd 

m  mass  mtxmtti  mwm  u  as 

Sukst  I  Drain  I  DtyCyc  < - 

aauaaa  ames  aa  1  instant  tine 

c  Oxide  r  Metal  r  Hot  Carr 


IDDB  9.7207052230E-14 
etal  0.023591 

EM  1.0851259788E-18 
jtC  8. 4182293786E-21 
on  t  0,000133 

ick  0.113872 

SD  0.053236 

isc  0.006201 


Tine 

Lankda 


0.262800 

0.291936 


Instant  Tine 
Instant  Lanhda 


Average  failure  rate! 
1.429717 


Any  key  to  view  chart. 


Instantaneous  Failure  Rate 


3.974809 

3.312341 

2.649873 

1.987485 

1.324936 

0.662468 


0  1  2  3  4  5  6  7  8  9  10  11  12 

Average  Lanhda  Ohser.  -  Loner  Upper  Vears  to  Fail 

1.429717  0.291936  3.312541  30 

Press  'E'  to  exit  or  any  other  key  to  vie*  the  next  set  of  data  points. 
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EXAMPLE  13 


VHSIC/VHSIC  Like  Model  |= 


Fart  Nuxter 

fliaiai 


Desc  Hfp  Package  Fins 

u  ti  kj  >m 


Complexity  i  Unit  Dev  Type  Die  Area  Theta  Ja 
■BUSSES  If 


Wi 


US  WiiRti 


Feat  Size  Defect  Den  Mfg  Fro 

mpm  aam  sj 


ESD  Met  Type 

IBS  U 


Scm  Duration 

ti 


Duration  Tenp  Pur 

mmum  m  ■ekh 


Act  Pur  Curr  Den  Elct  Fid  Env  Wind 

■BM«  ■aagasca  u  as 


Suts t  I  Drain  I 

tttatttm  mam 

r  Oxide  r  Metal 


5&c>t 

r  Hot  Carr 


Instant  Tine 
Instant  Lambda 


<!*i  tffc'tCYtfV 


j  Factor 

Value  1 

Oxide 

0.023713  I 

TDD9 

2.4301763057E-14 

Metal 

0.005898 

EM 

1. 0851259788E-18 

Hot  C 

8.4182293786E-21 

Cont 

0.000133 

Pack 

0.113872 

ESD 

0.053286 

Misc 

0.006201 

Tine 

0.262800 

Lambda 

0.203103 

Average  failure  rate: 
0. 493851 

Any  key  to  vieu  chart. 


Instantaneous  Failure  Hate 


Average  Lanbda  Obser.  -  Lower  Upper  Years  to  Fail 

0.493851  0.203103  0.969829  30 

Press  'E'  to  exit  or  any  other  key  to  view  the  next  set  of  data  points. 
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EXAMPLE  14 


MHS1C/VHSIC  Like  Model 


Package 

Ki 


Pins 


Conplexity 

■MtE&SS 

Feat  Size 

IflHS 


i  Unit  Dev  Type  Die  Area  Theta  Ja 

[]  ti  UE3  Wiiwa 


Defect  Den 


Mfg  Pro 
<3 


ESD  Net  Type 
USB  Ki 


Scrn  Duration 

K)  ■BEsaia 


Tew 
Curr  Den 


& 


Act  Pvr 


Pvr 

■E2KS 


Elct  Fid  Env 

■asm  ii 


RHnd 

tftiB 


Subst  I 


Drain  I 

imam 


r  Oxide  t  Metal 


&&Cst 

%  Hot  Carr 


Instant  Tine 


Instant  Lanbda 


Factor 

Value 

Oxide 

0.010539 

TDDB 

1.0800783581E-14 

Metal 

0.002621 

EM 

1.0851259788E-18 

Hot  C 

8. 4182293786E-21 

Cont 

0,000133 

Pack 

0.113872 

ESD 

0.053286 

MiSc 

0.006201 

Tine 

0,262800 

Lanbda 

0,186653 

Average  failure  rate: 
0. 320542 

Any  key  to  vieu  chart. 


EXAMPLE  15 


VHSIC/UHSIC  Like  Model 


Part  Number _  Desc  Mfr  Package  Pins 

U  ti  tl  m 

Complexity  i  Unit  Dev  Type  Die  Area  Theta  Ja 

■■iww  ij  n  tang  hmm 

Feat  Size  Defect  Den  Mfg  Ppo  ESD  Met  Type 

Him  atom  <j  11m  k 

Scrn  Duration  Temp  Pur 

n  ■mama  m  mtes 

Tup  Act  Pur  Curr  Den  . .  Elct._F.ld  Env  RHmd 

m  mum  ara-twwa  wmam  u  stts 

Subst  I  Drain  1  Dty  Cyc  - 

mX3m  mum  !!E3  Instant  Time 


Factor 


Time 

Lambda 


0.005270 

5.4003917905E-15 

0.001311 

1.0851259788E-18 
8. 4182293786E-21 
0.000133 
0.113872 
0.053286 
0.006201 

0.262800 

0.180073 


EXAMPLE  16 


VHSIC/VHSIC  Like  Model 


Part  Hunter 

1M 


Package 

tl 


Pins 


^^Mitg  4  Uni i  Dev  Type 


U 


Die  Area  Theta  Ja 
5*b  itaag 


Feat  Size  Defect  Den  Mfg  Pro 
MBS  2J283  <J 

Sera  Duration  Tenp  Pup 


ESD  Met  Type 

IQS  fcj 


Duration  lew  Pup 

wi  mmm  m  wms 

Imp  Act  Pup  Cupp  Den 

ib  mffl  ■atm 

Subst  I  Drain  I  Dty  Cyc 

imam  UE3 

r  Oxide  r  Metal  r  Hot  Capp 


Elct  Fid  Env 

II 


RHnd 

2*fl 


Instant  Tine 


Instant  Lanhda 


«*.  KY«V« 


Factor 

Value 

Oxide 

8.882189 

TDDB 

2.1601567162E-15 

Metal 

8.888524 

EM 

1.8851259788E-18 

Hot  C 

8.4182293786E-21 

Cont 

8.888133 

Pack 

8.113872 

ESD 

8.853286 

Misc 

8.886281 

Tine 

8.262888 

Laxhda 

8.176124 

Average  failure  rate: 
8.289625 


Any  key  to  vieu  chart. 


EXAMPLE  17 


=4  VHSIC/VHSIC  Like  Model 


Part  Hunter 

amiaair- 

CoMplexita  1  Unit 
■■EE&S  J 


Package 

Ki 


Pins 


Dev  Type 


Die  Area  Theta  Ja 

Etfi  M.-JW3 


Feat  Size 


Defect  Den 


Mfg  Pro 
<1 


ESD  Het  Type 

i m  a 


Scm  Duration 

a  mmt¥m 


Tenp  Pur 


Act  Pur 


Tenp 

!Hjf 

Curr  Den 


Elct  Fid  Env  PHnd 

i!  ssa 


Suist  I 


Drain  1 

mxs& 


r  Oxide  r  Metal 


5&CM 

r  Hot  Carr 


Instant  Tine 
Instant  Lantda 


Factor 

Value 

Oxide 

0. 061975 

TDD3 

2. 7015794139E-15 

Metal 

0.000452 

EM 

2.5286245249E-16 

Hot  C 

8. 4726084007E-21 

Cent 

0.  000133 

Pack 

0.113872 

ESD 

0.053286 

Mise 

0.006192 

Tine 

0.262800 

Lanbda 

0.175910 

Average  failure  rate: 

0.207784 

1  Any  key  to  vieu  chart,  j 
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EXAMPLE  18 


Part  Nuxher 


VHSIC/VHSIC  Like  Model 


ti  i] 


Complexity  4  Unit  Dev  Tape  Die  Are 

■Bmrn  u  ti  bib 

Feat  Size  Defect  Den  Mfg  Pro  ESD 

-mmi  <j  us 


Scrn  Duration 

ti  BEE3BB 


Tap  Ac 

t£  ■ 

Suist  I 


Act  Pwr 


leu? 
SB  m 

Curr  Den 


1- . 

tge  Pins 

Factor 

Value  | 

m 

Oxide 

0.001790 

TDDB 

3.7296118073E-15 

Theta  Ja 

Metal 

0, 000362 

WitJtfi 

EM 

5.4890132761E-14 

Hot  C 

8.3550729288E-21 

Met  Type 

Cont 

0.000133 

ti 

Pack 

0,113872 

ESD 

0.053286 

Misc 

0.006177 

Tine 

0.262800 

Env  RHmd 

■i  >a>iri 

Laxhda 

0.175620 

Drain  I  Dty  Cyc 


Average  failure  rate: 
0. 205243 


11.0  MODIFICATIONS  TO  THE  MODEL 


The  detailed  model  contained  in  this  report  is  an  industry  wide  representation  of  state-of- 
the-art  VLSI/VHSIC  CMOS  reliability.  It  is  recognized,  however,  that  the  best  reliability 
predictions  are  accomplished  based  on  empirical  reliability  data  from  a  specific  fabrication  process. 
This  empirical  data  is  ideally  field  failure  rate  experience  but  can  also  be  life  test  results. 

As  mentioned  in  the  presentation  of  the  detailed  model,  there  is  a  correction  factor  tcc, 

which  can  be  used  to  modify  the  model  as  more  empirical  data  does  become  available.  At  the 
writing  of  this  report,  ttc  is  one  (1)  by  definition  since  all  data  available  was  used  in  the  derivation 

of  the  model  and  therefore  on  average,  the  observed  data  equals  the  predicted  data. 

The  Ttc  factor  can  therefore  be  used  for  the  following  purposes: 

(1)  To  modify  the  model  as  VHSIC  field  experience  data  becomes  available. 

'  (2)  To  modify  the  model  for  a  particular  fabrication  process  based  on  the  availability  of 
high  quality  life  tests. 

The  second  purpose  above  should  only  be  used  in  the  absence  of  defect  density  data  when 
2 

the  dofault  value  (Xq/Xs)  is  used.  Additionally,  either  of  these  methods  should  only  be  used 
when  there  is  statistically  significant  amounts  of  high  quality » lata  available. 


The  method  of  calculating  is  a  straightforward  geometric  mean  of  observed/predicted 
ratios: 


1 
n 

where:  \  «  observed  failure  rate 
Xp  »  predicted  failure  rate 
n  «  number  of  failure  rate  observances 
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The  predicted  failure  rate  should  be  an  average  value  over  the  time  interval  the  observed  data 
was  taken.  All  other  inputs  should  be  as  close  to  the  actual  values  of  the  test  as  possible. 

A  manufacturer  should  also  be  given  the  opportunity  to  adjust  the  failure  rates  for  the 
individual  failure  mechanisms  in  the  event  adequate  data  on  specific  failure  mechanisms  exists. 
This  should  be  done  only  if  there  is  a  large  quantity  of  failures  which  have  been  failure  analyzed  to 
determine  the  cause  of  failure.  If  this  option  is  chosen,  all  failures  must  be  accounted  for  and 
categorized  into  one  of  the  mechanisms  in  the  present  model.  To  accomplish  this,  the  methodology 
described  previously  in  this  report  for  modeling  the  early  life  oxide,  metal,  contamination,  and 
miscellaneous  failure  mechanisms  should  be  used. 
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12.0  CONCLUSIONS  AND  RECOMMENDATIONS 


12.1  CONCLUSIONS 

A  reliability  prediction  model  has  been  developed  for  CMOS  VHSiryVHSIC-Like  devices 
from  analysis  and  failure  rate  modeling  of  specific  failure  mechanisms.  To  quantify  the  failure 
rates  of  each  of  these  mechanisms,  a  database  was  built  which  contains  life  test  and  environmental 
test  results.  Since  this  database  was  built  from  many  manufacturers  data,  the  failure  rate 
predictions  are  industry  wide  representative  values,  and  will  vary  from  manufacturer  to 
manufacturer.  This  effort  concluded  that  the  best  way  to  account  for  these  differences  is  to  use 
actual  defect  densities.  It  is  believed  that  the  use  of  actual  defect  densities,  if  properly  measured, 
will  result  in  predicted  reliability  values  which  are  more  precise  and  accurate  than  conventional 
regression  type  prediction  models. 

Derived  from  the  detailed  model,  a  simpler  "short  form"  model  was  developed  with  the 
understanding  that  systems  engineers  often  need  a  quick  reliability  prediction  tool  with  easily 
accessible  input  parameters. 

This  effort  was  the  first  in  support  of  the  MIL-HDBK-217  VLSI  integrated  circuit 
reliability  prediction  models  that  deviated  from  the  traditional  statistical  analysis  of  field  data.  A 
combination  of  physics  of  failure  information,  life  test  results,  screening  results,  and  test  structure 
data  was  used  to  achieve  the  study  objectives.  These  objectives  were  not  only  to  develop  a 
reliability  model  for  VHSIC/VHSIC-Like  CMOS  devices,  but  also  to  develop  a  methodology  that, 
if  successful,  can  be  effectively  used  in  a  timely  manner  to  develop  reliability  prediction  models  for 
emerging  microcircuit  technologies  , 

The  authors  believe  tiiat  the  detailed  model  presented  in  this  report  is  generally  more 
accurate  and  more  sensitive  to  the  fabrication  and  stress  variables  that  truly  effect  device  reliability. 
The  tradeoff  for  this  improvement  is  die  complexity  of  the  failure  rate  equations  themselves,  which 
if  done  by  hand  can  be  very  time  consuming.  For  this  reason,  it  is  desirable  to  computerize  the 
prediction  model,  thus  avoiding  tedious  calculations.  The  short  for  model  developed,  although 
easier  to  use,  is  expected  to  yield  less  precise  predictions.  Although  less  accurate  than  the  detailed 
model,  data  lias  been  presented  indicating  that  the  short  form  model  precision  is  approximately 
equivalent  ^traditional  regression  analysis  models. 
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12.2  RECOMMENDATIONS 


DTRI/Honeywell  recommends  that  the  model  contained  herein  (the  detailed  version,  short 
form,  or  both)  be  incorporated  into  MIL-HDBK-217.  The  authors  believe  that  this  model  currently 
represents  the  best  available  general  purpose,  small  feature  size  CMOS  device  reliability  prediction 
methodology.  It  is  also  recommended  that  users  collect  the  data  necessary  for  use  of  the  detailed 
model,  since  it  is  more  accurate  than  the  short  form  model  Also,  by  exercising  the  detailed  model 
with  actual  data,  feedback  can  be  obtained  to  determine  if  this  modeling  approach  can  be  accurately 
extended.  „j  other  technologies,  and  for  future  device  types. 

It  is  also  recommended  that  more  attention  be  given  to  collecting  accurate  field  failure  rate 
information  for  these  device  types  as  part  of  a  government  sponsored  program.  This  data  should 
then  be  submitted  to  a  central  repository  of  data  such  as  the  Reliability  Analysis  Center,  so  that  it 
would  be  available  for  M3LrHDBK-217  model  development  efforts. 

There  are  also  many  logical  follow  on  studies  to  this  effect  which  would  enhance  the 
knowledge  base  of  VLSI/VHSIC  reliability  characterization.  The  most  obvious  possibly  is  to 
collect,  when  available,  field  failure  rate  and  cause  data  and  refine  the  model  accoidingly.  In 
addition  to  field  reliability  data,  life  test  and  screening  data  should  continue  to  be  collected  and 
analyzed  so  that  a  comprehensive  database  can  be  built 

Another  very  useful  effort  would  be  the  extension  of  the  methodology  developed  herein  to 
bipolar  VHSIC  and  VLSI  technologies  since  much  of  the  modeling  douc  in  this  model  could  be 
applied  to  bipolar  devices. 


One  goal  of  this  effort  was  to  relate  failure  rate  prediction  to  efforts  such  as  the  Generic 
Qualification  Program.  Although  a  certain  degree  of  success  was  attained  toward  this  goal,  it  was 
ultimately  concluded  that  there  is  currently  a  lack  of  standardization  throughout  the  industry  in 
quantifiable  parameters  that  could  be  used  in  a  reliability  model.  Since  there  are  various 
standardization  efforts  jnderway  in  this  area,  die  results  of  these  efforts  should  be  investigated  for 
use  as  reliability  indicators.  Furthermore,  a  very  useful  related  effort  would  be  to  define  standard 
methodologies  for  the  quantification  of  reliability  characteristics. 
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SURVEY  LETTER  AND  FORM 


R 

commitment  to  excellence 


IIT  Research  Institute 
Beeches  Technical  Campus 
Rte.  26N 

Rome,  New  York  13440 
315/336-2359 


April  16,  1987 


Mr.  Jeff  Katz 

Director  of  Marketing  ASIC  Products 

Intel  Corporation 

3065  Bowers  Avenue 

Mail  Stop:  SCI-5 

Santa  Cara,  CA  95051 

Dear  Mr.  Katz: 

IIT  Research  Institute  (IITRI)  has  recently  initiated  a  study  under 
contract  to  Rome  Air  Development  Center  (RADC  Contract  Number  F30602-86-C-0261) 
to  develop  VHSIC  and  VHSIC-like  CMOS  reliability  prediction  models.  Since 
meaningful  amount.,  of  empirical  field  reliability  data  are  not  expected  to 
be  available  for  use  in  model  development,  these  prediction  methodologies 
will  be  based  primarily  on  information  available  during  circuit  fabrication 
such  as  test  structures,  yield,  and  screening  information.’ 

The  intent  of  this  study  is  to  correlate  this  information  with  field 
reliability  performance  for  each  failure  mechanism  of  interest  and  to  combine 
these  mechanisms  into  a  useable  prediction  model  form,  ultimately  for  inclusion 
in  MIL-HDBK-217,  "Reliability  Prediction  of  Electronic  Equipment". 

To  achieve  the  goals  of  this  program  and  to  develop  prediction  models 
valuable  to  users,  IITRI  needs  information  on  a  wide  variety  of  part  types 
and  fabrication  lines.  We  are  currently  seeking  data  and  information  from 
organizations  involved  in  VHSIC/VHSIC-like  fabrication  in  the  following  areas; 
field  failure  rates,  life  test  results,  screening  data,  yield  data,  and  test 
structures  data.  Any  information  your  organization  can  provide  in  these 
areas  will  greatly  assist  us  in  achieving  our  goals. 

Any  information  submitted  to  IITRI  for  use  in  this  effort  will  be  kept 
strictly  proprietary,  without  traceability  to  data  submittors. 

We  feel  It  will  be  very  Important  to  the  electronics  industry  to  have 
accurate  reliability  prediction  models  for  VLSI  and  VHSIC  devices,  and  that 
we  can  provide  good  models  if  cooperation  is  obtained  from  the  semiconductor 
industry.  Please  fill  out  the  attached  survey  form  and  send  back  to  IITRI, 
at  which  time  one  of  our  representatives  will  call  to  further  pursue  these 
matters. 


If  there  are  any  questions,  please  call  Alex  Recchio  at  (315)  336-2359. 
IITRI  very  much  appreciates  your  cooperation  in  completing  the  enclosed  survey 
and  looks  forward  to  your  participation  in  this  effort. 


Sincerely, 


Will iam  K.  Denson 
Project  Engineer 


()}y  $eJLc 

Alex  Recchio 
Senior  Data  Specialist 


WKD/AJR: jev 
Attachment 


VLSI/VHSIC  RELIABILITY  SURVEY 


Name:  _ 

Tide:  _ 

Organization:. 

Division: _ 

Address: _ 


Phone  Number: _ _ _ _ _ 

(1)  Does  your  organization  manufacture  or  use  MOS  VLSI  or  VHSIC  circuits? 


If  so,  please  outline  their  characteristics 
•  Device  type  (Memory,  Microprocessor,  etc.) 


•  Feature  size  (Gate  Length,  Metal  Width) 


•  Complexity  (Approx.  Number  of  Gates,  Transistors) 


*  Packaging  (Type,  Number  of  Pins) 


(2)  Please  check  below  die  type(s)  of  data  that  your  organization  lias  on  the  above  device(s): 

_  Field  Failure  Rates 

_  Life  Test  Results 

_ _  Screening  Daw  (of  any  Type) 

_  Failure  Analysis  Data 
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If  Failure  Analysis  Data  is  available,  which  failure  mechanisms/modes  are  observed  (along 
with  relative  percentages  of  occurrence)? 

Percentage 


_  Electromigration  _ 

_  Dielectric  Breakdown  _ 

_  Soft  Errors  _ 

_  Parametric  Drift  _ 

_  Hot  Electrons  _______ 

_  Latch  Up  _ 

_____  Electrical  Overstress  _______ 

_  Package  Related  ______ 

_  Quality  control  monitor  data  (test  structures,  reliability  evaluation 

monitors,  etc.) 

If  yes,  which  types  are  available? 


(3)  Can  the  above  data  be  made  available  to  I1TRI  for  tins  study? 


(4)  Could  you  please  provide  point*of-contact  (if  other  titan  addressee)? 


Name: 

Title: 

Phone: 


(3)  Remarks: 


APPENDIX  B: 


OXIDE  AND  METAL  DEFECT 
DENSITY  CALCULATIONS 


It  is  assumed  that  the  defects  are  random.  First  metal  layer  defect  density,  D0  met,  is  established 

with  an  interdigitated  meander,  shown  in  Figure  1.  The  meander  is  laid  out  with  the  minimum  first 
metal  line  width  and  space  dimensions  allowed  for  the  device  in  question.  The  minimum 
serpentine  length  should  be: 

Wt  ^max 

r 

where  is  the  die  area  of  the  maximum  slse  device  to  be  manufactured  in  the  same 
technology,  and  b  the  minimum  pitch  allowed  by  the  layout  rales  for  the  first  metal  layer. 

Data  from  a  minimum  of  50  randomly  selected  test  structure  sizes  must  be  used. 


Three-  types  of  test  measurements  are  performed  on  these  structures.  First,  for  the  contact  test, 
two-probe  resistance  measurements  a re  made  between  nodes  1  and  8, 2, 3, 4,  5, 6  and  7.  Second, 
four-probe  resistance  measurements  are  made  on  the  serpentine  with  current  for  ced  at  nodes  2  and 
7  in  Figure  1,  and  voltage  sensed  at  nodes  3  and  6.  Third,  for  the  bridging  test,  leakage  between 
adjacent  metal  lines  is  measured  by  forcing  a  voltage  at  the  node  formed  by  connecting  together 
nodes  1, 4, 5,  and  8  in  Figure  1  and  measuring  a  leakage  with  a  current  meter  connected  between 
ground  aid  the  node  formed  by  connecting  together  nodes  2, 3, 6  and  7  in  Figure  1.  The  pass 
range  for  the  two-probe  resistance  measurements  is  between  0.5x  nominal  calculated  resistance  and 
1.5x  nominal  calculated  resistance.  The  pass  range  for  the  serpentine  measurement  is  between 
0.5x  nominal  calculated  serpentine  resistance  and  1.5x  nominal  resistance.  1’lrc  pass  range  for  the 
leakage  resistance  between  adjacent  metal  lines  is  between  10  times  the  nominal  resistance  of  tire 
serpentine  and  1.0E18  ohms. 


Yields  are  calculate?!  in  the  following  way.  For  contact  yield. 


Yc  =  l 


N, 


’cp 


cp 


where  Nq,  is  the  number  of  sites  that  pass  the  contact  test,  and  Ncf  is  the  number  of  sites  that  fail 
the  contact  test  If  Yc  is  less  than  0.5  (50%)  then  the  test  must  be  performed  again.  For  serpentine 

and  bridging  yield, 

N ch  n 

where  of  the  sites  that  pass  the  contact  test  Ws^p  is  the  number  of  sites  that  pass  the  serpentine  and 
the  bridging  tests,  and  is  the  number  of  sites  that  fail  either  the  serpentine  or  the  bridging 
tests. 


The  defect  density  is  calculated  from  the  yield  with  the  following  equation: 

n 

It  is  assumed  that  the  defects  are  random.  Gate  oxide  defect  density,  DCk  ox,  is  estabUshed  with  a 
capacitor  test  structure,  shown  in  Figure  2.  lire  gate  oxide  area  should  be: 


83  W  %x 


where  Ainax  is  the  die  area  of  Use  maximum  size  device  to  be  manufactured  in  the  same  technology 

as  the  device  in  question.  Data  from  a  minimum  of  50  randomly  selected  tost  structure  sices  must 
be  used 


Two  types  of  test  measurements  are  performed  on  these  structures.  First,  for  the  contact  test,  two- 
probe  resistance  measurements  are  made  between  nodes  1,  2,  3,  4,  5  and  6.  Second,  for  the 

leakage  test,  leakage  between  gate  and  the  n-  or  p-  channel  silicon  is  measured  by  forcing  a  voltage 
(Vdd)  at  the  node  formed  by  connecting  together  nodes  1, 2, 3  and  4  in  Figure  2  and  measuring  a 

leakage  with  a  current  meter  connected  between  ground  and  the  node  formed  by  connecting 
together  nodes  5  and  6  in  Figure  2.  The  pass  range  for  the  two-probe  resistance  measurements  is 
between  0.5  X  nominal  calculated  resistance  and  1.5  x  nominal  calculated  resistance.  The  pass 
range  for  the  leakage  resistance  the  gate  and  silicon  is  between  1  0E-10  ohms  and  1.0E18  ohms. 

Yields  are  calculated  in  the  following  way.  For  contact  yield, 

Ncp 

YC-  bfCp  +  Nc7 

where  Nqj  is  the  number  of  sites  that  pass  the  contact  test,  an  d  N^p  is  the  number  of  sites  that 
fail  the  contact  test  If  Yc  is  less  than  0.5  (50%)  then  the  test  must  be  performed  again.  For  oxide 

leakage  yield, 

NCap  P 

y°*“ScJp*c5f 

where  of  the  sites  that  pass  the  contact  test,  N^ap  p  is  the  number  of  sites  that  pass  t h«  leakage 
test,  and  Nq^  p  is  the  number  of  sites  that  fail  the  leakage  tests. 


The  defect  density  is  calculated  from  the  yield  with  the  following  equation: 


D, 


»ox 


Hn  Ycv) 

— — 

‘ox 
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FIGURE  1: 

INTERDIGITA1ED  MEANDER 


FIGURE  2: 

CAPACITOR  TEST  STRUCTURE 

B-8 


APPENDIX  C: 
FIELD  DESCRIPTIONS 


PART  NUMBER  is  the  part  identification  number  of  the  current  device.  This  field  is  not 
required  for  calculation  of  the  part  This  field  is  required  for  storage  of  parameters. 
There  are  no  constraints  on  the  contents  of  this  field. 

PART  DESCRIPTION  is  the  generic  part  type  of  the  current  device.  This  field  is  not 
required  for  calculation  of  the  part  or  storage  of  parameters.  The  contents  of  this  field  is 
selected  from  a  look  up  list,  to  which  new  entries  may  be  added. 

MANUFACTURER  is  the  manufacturer  of  the  current  device.  This  field  is  not  required 
for  calculation  of  the  part  cr  storage  of  parameters.  The  contents  of  this  field  is  selected 
from  a  look  up  list,  to  which  new  entries  may  be  added. 

PACKAGE  TYPE  is  the  type  of  package  enclosure  of  the  current  device.  This  field  is 
required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents  of  this  field  is 
selected  faun  the  look  up  list,  which  may  not  be  added  to. 

NUMBER  OF  PINS  is  the  total  number  of  pins  (including  pins  not  internally  connected) . 
on  the  current  device.  This  field  is  required  for  calculation  of  the  part  and  storage  of 
premeters.  Tks  contents  of  tliis  field  is  catered  directly  and  must  be  greater  then  zero. 

COMPLEXITY  is  the  gate,  bit  or  transistor  count  of  the  current  device.  The  UNITS  are 
MG"  for  gate,  “BM  foe  bit,  and  T**  for  transistor.  Tills  field  is  not  required  for  calculation 
of  the  pan  or1  storage  of  The  contents  of  this  field  is  entered  directly. 

DEVICE  TYPE  is  the  basic  family  to  which  the  current  device  belongs.  Tliis  field  is 
required  for  calculation  of  tire  part  and  storey  of  paremetets.  The  contents  of  this  field  is 
selected  from  a  k»k  up  list,  which  may  not  be  added  to. 

DIE  AREA  is  the  size,  in  square  centimeters,  of  the  die  of  the  current  device.  This  field 
is  required  for  calculations  of  the  part  and  storage  of  parameters.  The  contents  of  this 
field  ts  entered  directly.  If  the  DIE  AREA  is  unknown,  use  99.00. 


THERMAL  COEFFICIENT  is  the  Junction  to  Ambient  thermal  resistance  of  the  device. 
This  field  is  required  for  calculation  of  the  pan  and  storage  of  parameters.  The  contents 
of  this  field  is  entered  directly. 
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•  FEATURE  SIZE  is  the  size,  in  microns,  of  the  smallest  feature  of  the  device.  For 
example,  VHSIO  Phase  1  is  1.25  microns.  This  field  is  required  for  calculation  of  the 
part  and  storage  of  parameters.  The  contents  of  this  field  is  entered  directly.  If  the 
FEATURE  SIZE  is  unknown,  use  99.000. 

•  DEFECT  DENSITY  is  the  number  of  critical  defects  per  square  centimeter.  This  field  is 
required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents  of  this  field  is 
entered  directly.  If  the  actual  DEFECT  DENSITY  for  the  feature  size  is  unknown,  use 
9999.00. 

•  MANUFACTURING  PROCESS  identifies  the  device  as  being  manufacturered  in 
accordance  with  the  Generic  Qualification  Program  (QML)  or  listed  on  the  M-38510 
QPL.  This  field  is  required  for  calculation  of  the  part  and  storage  of  parameters.  The 
contents  of  this  field  is  selected  from  a  look  up  list,  which  may  not  be  added  to. 

•  ESD  SUSCm’ffilLIYY  LEVEL  is  the  worst  case  threshold  voltage  of  the  device  relative 
to  the  100pF,  1500  OHM  Model  in  accordance  with  M1L-STD-883B,  Method  3015. 
This  field  is  required  for  calculation  of  the  part  and  storage  of  parameters.  The  content? 
of  tills  field  is  entered  directly.  If  a  RANGE  of  SUSCEPTIBILITY  is  known,  use  the 
MIDPOINT  voltage  of  the  RANGE.  If  unknown,  use  99999. 


•  METAL  TYPE  is  the  metallization  material  used  This  field  is  required  for  calculation  of 
the  part  and  storage  of  parameters.  The  contents  of  this  field  is  selected  from  a  look  up 
list,  which  may  be  not  added  to. 

•  SCREEN  TYPE  is  the  amount  of  screening  the  (tevkic  has  been  subjected  m.  litis  field 
is  required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents  of  tills  field 
is  selected  from  a  look  up  list,  which  may  ms  i»  sddedto. 

•  SCREEN  DURATION  is  the  length  of  the  screen  perforated  on  tire  device  la  millions  o; 
hours.  This  field  is  required  for  calculation  of  the  part  and  storage  of  parameters.  Hie 
contents  of  this  field  is  entered  directly  and  must  be  greater  then  zero. 
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•  SCREEN  TEMPERATURE  is  the  ambient  temperature  of  the  screen  performed  on  the 
device,  in  degrees  centigrade.  This  field  is  required  for  calculation  of  the  part  and  storage 
of  parameters.  The  contents  of  this  field  is  entered  directly  and  must  be  greater  then  zero. 

•  SCREEN  POWER  LEVEL  is  the  power  dissipated,  in  watts,  by  the  device  in  the  screen 
performed.  This  field  is  required  for  calculation  of  the  part  and  storage  of  parameters. 
The  contents  of  this  field  is  entered  directly  and  must  be  greater  then  zero. 

•  AMBIENT  TEMPERATURE  is  the  temperature  which  the  device  is  exposed  to,  in 
degrees  centigrade.  This  field  is  required  for  calculation  of  the  part  and  storage  of 
P'uameters.  The  contents  of  this  field  is  entered  directly  and  must  be  greater  then  zero. 

•  ACTUAL  POWER  LEVEL  is  the  power  dissipated  by  the  device  in  its  intended 
application,  in  watts.  This  field  is  required  for  calculation  of  the  part  and  storage  of 
parameters.  The  contents  of  this  field  is  entered  directly  and  must  be  greater  then  zero. 

•  CURRENT  DENSITY  is  the  average  absolute  value  current  density  of  the  majority  of 
metal  runs,  in  10E06  Amps/sq  cm.  This  field  is  required  for  calculation  of  the  part  and 
storage  of  parameters.  The  contents  of  this  field  is  entered  directly  and  must  bo  greater 
then  zero. 


•  ELECTRIC  HELD  is  the  average  electric  field  value  in  the  gate  oxide,  in  megavolts/cm. 
This  field  is  required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents 
of  this  field  is  catered  directly  and  must  be  greater  then  zero. 

•  ENVIRONMENT  is  the  application  environment  the  device  is  operating  in.  This  field  is 
required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents  of  thisfieldis 
selected  from  a  look  up  list,  which  may  not  be  added  to. 

•  RELATIVE  HUMIDITY  is  the  average  relative  humidity  expected  in  the  device 
environment  This  field  is  required  for  calculation  of  the  part  and  ■  torage  of  parameters 
for  nomhemretic  packages.  Tire  contents  of  this  field  is  entered  directly  and  must  be 
greater  then  zero. 
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•  SUBSTRATE  CURRENT  is  the  current,  in  milliamperes,  the  de'tice  substrate  carries. 
This  field  is  required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents 
of  this  field  is  entered  directly  and  must  be  greater  then  zero.  If  unknown,  enter 
9999.000000. 

*  DRAIN  CURRENT  is  the  current,  in  milliamperes,  the  device  drain  carries.  This  field  is 
required  for  calculation  of  the  part  and  storage  of  parameters.  The  contents  of  this  field  is 
entered  directly  and  must  be  greater  then  zero.  If  unknown,  enter  9999.000000. 

♦  DUTY  CYCLE  is  the  percentage  of  total  time  the  device  is  operated.  For  example,  50% 
operation  — >  duty  cycle  =  0.50.  This  field  is  required  for  calculation  of  the  part  and 
storage  of  parameters.  The  contents  of  this  field  is  entered  directly  and  must  be  greater 
then  zero. 

•  SIGMA  OXIDE  is  dielectric  breakdown  failures  observed  from  time  to  failure  data  on  the 
oxide  or  similar  oxide.  This  field  is  required  for  calculation  of  the  part  and  storage  of 
parameters.  The  contents  of  this  field  is  entered  directly  and  must  be  greater  then  zero. 
If  unknown,  use  9999.000. 

•  SIGMA  METAL  is  tlw  sigma  for  elccfco-ndgradon  failures  observed  from  time  to  failure 
data  on  the  metal,  or  similar  metal,  of  the  device.  This  field  is  required  for  calculation  of 
the  part  and  storage  of  parameters.  The  contents  of  this  field  is  entered  directly  and  must 
be  greater  then  zero. 

*  SIGMA  HOT  CARRIER  is  the  sigma  for  hot  carriers  failures  observed  from  time  to 
failure  data  oti  the  same  carrier,  or  similar  devices.  This  field  is  required  for  calculation 
of  the  part  and  storage  of  parameters.  The  contents  of  this  field  is  entered  directly  and 
must  bo  greater  then  zero. 
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MISSION 


Rome  Air  Development  Center 


RADC  plans  and  executes  research,  development,  test  and 
selected  acquisition  programs  in  support  of  Command,  Control, 
Communications  and  Intelligence  (CiI)  activities.  Technical  and 
engineering  support  within  areas  of  competence  is  provided  to 
ESD  Program  Offices  (POs)  and  other  ESD  elements  to 
perform  effective  acquisition  of  C3/  systems.  The  areas  of 
technical  competence  include  communications,  command  and 
control,  battle  management  information  processing,  surveillance 
sensors,  intelligence  data  collection  and  handling,  solid  state 
sciences,  electromagnetics,  and  propagation,  and  electronic 
reliability  /maintainability  and  compatibility. 


